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(4) of the Deregulation and Contracting Out Act 1994, to sign and issue certificates on 
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Cooperation Treaty at the UK Receiving Office. The application was allocated the 
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In accordance with the Patents (Companies Re-registration) Rules 1982, if a company 
named in this certificate and any accompanying documents has re-registered under the 
Companies Act 1980 with the same name as that with which it was registered 
immediately before re-registration save for the substitution as, or the inclusion as, the last 
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Name, of receiving Office and "PCT International Application" 



Applicant's or agent's file reference 
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Box No. I TITLE OF INVENTION 

Receptor 




Box No. II APPLICANT [ ] This person is also inventor 


Name and address: (Family name follo\\>ed by given name; for a legal entity, full official^ 

The address must include postal code and name of country. The country of the address indicated in this 

Box is the applican t 's State ( that is, coun try) of residence if no State of residence is indicated belo w.) 

Paradigm Therapeutics Limited 
St John's Innovation Centre 
Cowiey Road 

Cambridge / 
CB4 OWS 
United Kingdom 


Telephone No. 


Facsimile No. 


Teleprinter No. 


Applicant's registration No. with the Office 


State (that is, country) of nationality: 
GB 


State (that is. country) of residence: 
GB 


This person is applicant | l all designated ail designated States except |~~] the United States | 1 the States indicated in 

for the purposes of: 1 1 States 1 ^ 1 the United States of America | | of America only | | the Supplemental Box 


Box No. ni FURTHER APPLICANT(S) AND/OR (FURTHER) INVENTOR(S) 


Name and address : (Family name followed by given name; for a legal entity, Jiill official designation. 
The address must include postal code and name of country. The country of the address indicated in this 
Box is theapplicant 's State (that is, country) of residence if no State of residence is indicated below,) 

CARLTON, Mark 
^^^Avlar BioVentures 
Compass House 

Vision Park * 
Chi vers Way ^ 
Htston, Cambridge, CB4 9ZR 
United Kingdom 


This person is: 

I 1 applicant only 

1 X 1 applicant and inventor 

1 1 inventor only (If this check-box 
\ 1 is marked, do not fill in below.) 


Applicant's registration No. with the Office 


State (that is, country) of nationality: 
GB 


State (that is, country) of residence: 

GB' 


This person is applicant l l all designated 1 1 all designated States except FTTl the United States 1" 1 the States indicated in 
for the purposes of: 1 1 States | | the United States of Amenca 1 ^ i of America only | | the Supplemental Box 



I X I Further applicants and/or (further) inventors are indicated on a continuation sheet'. 



Box No. IV AGENT OR COMMON REPRESENTATIVE; OR ADDRESS FOR CORRESPONDENCE 



The person identified below is hereby/has' been appointed to act on behalf 
of the applicant(s) before the competent International Authorities as: 



agent 



□ common 
representative 



Name and address: (Family name followed by given name; for a legal entity, full official designation. 
The address must include postal code and name of country.) 



KHOO, Chong-Yee 
D Young & Co • 
21 New Fetter Lane 
London 
EC4A 1 DA 
ENGLAND 



Telephone No, 

+44 23 8071 9500 



Facsimile No. 

+44 23 8071 9800 



Teleprinter No. 
477667 YOUNGS G 



Agent's registration No. with the Office 



□ 



Address for correspondence: Mark this check-box where no agent or common representative is/has been appointed and the 
space above is used instead to indicate a special address to which correspondence should be sent. 
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1 1 is marked, do not fill in below.) 


Applicant* s registration No . with the OfBce 


State (that is, country) of nationality: 
PT 


State (that is, country) of residence: 
GB 


This person is applicant j i all designated | 1 all designated States except f^l United States j [ the States indicated in 

for the purposes of: 1 J States L 1 the United States of America L±J of America only j | the Supplemental Box 



3? 



6> 



*4 



Continuation of Box No, m FARTHER APPLICANT(S) AND/OR (FURTHER) INVENTOR(S) 

If none of the folloMfing sub-boxes is used, this sheet should not be included in the request. 



Name and address : (Family name followed by given name; for a legal entity, Jull official designation. 
The address must include postal code and name of country. The country of the address indicated in this 
Box is the applicant's State (that is. country) of residence ifno State of residence is indicated below.) 



This person is: 

I I applicant only 



Name and address: (Family name fallowed by given name; for a legal entity, fill official designation. 
The address must include postal code and name of country. The country of the address indicated in this 
Box is the applicant 's State (that is. country) of residence ifno State of residence is indicated below.) 



DIXON, John 
^e^Avlar BioVentures 
Compass House 
Vision Parl< 
Chivers Way 

Histon, Cambridge, CB4 9ZR 
United Kingdom 


1 X 1 applicant and inventor 

1 1 inventor only (If this check-box 
1 1 is marked, do not fill in below,) 


Applicant's registration No. with the Office 
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This person is: 
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THRESHER, Rosemary 
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1 X 1 applicant and inventor 
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Applicant' s registration No . with the Office 


State (that is. country) of nationality: 
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State (that is. country) of residence: 
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ZAHN. Dirk 
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Compass House 
Vision Park 
Chivers Way 

Histon. Cambridge, CB4 9ZR 
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State (that is. country) of nationality: 
DE 



This person is: 

I I applicant only 

I X I applicant and inventor 

□ inventor only (If this check-box 
is marked, do not fill in below.) 



Applicant's registration No. with the Office 



State (that is, country) of residence: 
GB 



This person is applicant 
for the purposes of: 



□ all designated | j all designated States except 
States I I — 



the United States of America 



Ethe United States 
of America only 



□ the States indicated in 
Ae Supplemental Box 



I Further applicants and/or (further) inventors are indicated on anotiier continuation sheet. 
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Box No, V DESIGNATION OF STATES Mark the applicable check-boxes belo^; at least one must be marked. 



The following designations are hereby made under Rule 4.9(a): 
Regional Patent 

12 AP ARIPO Patent: GH Ghana, GM Gambia, KE Kenya, LS Lesotho, MW Malawi, MZ Mozambique, SD Sudan, 
SL Sierra Leone, SZ Swaziland, TZ United Republic of Tanzania, UG Uganda, ZW Zimbabwe, and any other State which is 
a Contracting State of the Harare Protocol and of the PCT 

IB EA Eurasian Patent: AM Armenia, AZ Azerbaijan, BY Belarus, KG Kyrgyzstan, KZ Kazakhstan,- MD Republic of Moldova, 
RU Russian Federation, TJ Tajikistan, TM Turkmenistan, and any other State which is a Contracting State of the Eurasian 
Patent Convention and of the PCT 

H EP European Patent: AT Austria, BE Belgium, CH & LI Switzerland and Liechtenstein, CY Cyprus, DE Germany, 
DK Denmark, ES Spain, FI Finland, FR France, GB United Kingdom, GR Greece, IE Ireland, IT Italy, LU Luxembourg, 
MC Monaco, NL Netherlands, PT Portugal, SE Sweden, TR Turkey, and any other State which is a Contracting State of 
the European Patent Convention and of the PCT 

G3 OA OAPI Patent: BF Burkina Faso, BJ Benin, CF Central African Republic, CG Congo, CI Cote d'lvoire, CM Cameroon, 
G A Gabon, GN Guinea, GW Guinea-Bissau, ML Mali, MR Mauritania, NE Niger, SN Senegal, TD Chad, TG Togo, and any 
other State which is a member State of OAPI and a Contracting State of the PCT (if other kind of protection or treatment desired, 
specify on dotted line) 

National Patent (if other kind of protection or treatment desired, specify on dotted line): 

H AE United Arab Emirates 12 GH Ghana [8 MX Mexico 

E AG Antigua and Barbuda 12 GM Gambia 12 MZ Mozambique 

CB AL Albania 12 HR Croatia H NO Norway 

12 AM Armenia 12 BCU Hungaiy IS NZ New Zealand 

12 AT Austria \S TO Indonesia B PL Poland 

B AU Australia IS DL Israel. H PT Portugal 

H AZ Azerbaijan 12 IN India [2 RO Romania 

12 BA Bosnia and Herzegovina 12 IS Iceland [2 RU Russian Federation 

12 BB Barbados E! JP Japan 

[2 BG Bulgaria ES KE Kenya H SD Sudan 

12 BR Brazil , CB KG Kyrgyzstan 12 SE Sweden 

[2 BY Belarus 12 KP Democratic People's Republic Q SG Singapore 

[2 BZ Belize of Korea [2 SI Slovenia 

H CA Canada E KR Republic of Korea |2 SK Slovakia 

12 CH & LI Switzerland and Liechtenstein H KZ Kazakhstan 12 SL Sierra Leone 

12 CN China 12 LC Saint Lucia . E TJ Tajikistan 

12 CO Colombia 12 LK Sri Lanka 12 TM Turkmenistan 

12 CR Costa Rica [2 LR Liberia . H TR Turkey 

E CU Cuba B LS Lesotho |2 TT Trinidad and Tobago : 

IS CZ Czech Republic 18 LT Lithuania 

E DE Germany IH LU Luxembourg (2 TZ United Republic of Tanzania 

12 DK Denmark H LV Latvia E UA Ukraine 

12 DM Dominica !2 MA Morocco H UG Uganda 

12 DZ Algeria 12 MD Republic of Moldova |2 US United States of America 

12 EC Ecuador 

H EE Estonia [2 MG Madagascar , |2 UZ Uzbekistan 

IS ES Spain 12 MK The former Yugoslav Republic of [2 VN Viet Nam 

12 FI Finland Macedonia |2 YU Yugoslavia 

12 GB United Kingdom H MN Mongolia |2 ZA • South A&ica 

B GD Grenada (S MWMalawi 12 ZW Zimbabwe 

H GE Georgia 



Check-boxes below reserved for designating States which have become party to the PCT after issuance of this sheet: 

ca PJi PhJIiPPipes E ZiVl Zambia □ 

[g OM Orrian [g TN Tunisia |--| 



Precautionary Designation Statement: In addition to the designations made above, the applicant also makes under Rule 4.9(b) all 
other designations which would be perinitted under the PCT except any designation(s) indicated in the Supplemental Box as being 
excluded from the scope of this statement. The applicant declares that those additional designations are subject to confirmation and that . 
any designation which is not confirmed before the expiration of 1 5 months from the priority date is to be regarded as withdrawn by the 
applicant at the expiration of that time limit. (Confirmation (includingfees) must reach the receiving Office within the 15-month time limit) 
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Supplemental Box If the Supplemental Box is not used, this sheet should not be included in the request. 



J. If, in any of the Boxes, except Boxes Nos. VlUfi) to (v) for which 
a special continuation box is provided, the space is insufficient 
to fitmish all the information: in such case, write "Continuation 
of Box No.... " (indicate the number of the Box) and furnish the 
information in the same manner as required according to the 
captions of the Box in which the space was insufficient, in 
particular: 

(i) if more than two persons are to be indicated as applicants 
and/or inventors and no "continuation sheet" is available: in 
such case, write "Continuation of Box No. Ill" and indicate for 
each additional person the same type of information as required 
in Box No. III. The country of the address indicated in this Box 
is the applicant 's State (that is, country) of residence if no State 
of residence is indicated below; 

(ii) if, in Box No. II or in any of the sub-boxes of Box No. Ill, the 
indication **the States indicated in the Supplemental Box" is 
checked: in such case, write "Continuation of Box No. 11" or 
"Continuation of Box No. Ill" or "Continuation of Boxes No. II 
and No. HI" (as the case may be), indicate the name of the 
applicant(s) involved and, next to (each) such name, theState(s) 
(and/or, where applicable, ARIPO, Eurasian, European or 
OAPI patent) for the purposes of which the named person is 
applicant; 

(Hi) if, in Box No. II or in any of the sub-boxes of Box No. Ill, the 
inventor or the inventor/applicant is not inventor for the 
purposes of all designated States or for the purposes of the 
VnitedStates of America: in such case, write "Continuation of 
Box No. 11" or "Continuation of Box No. lU" or "Continuation 
of Boxes No, II and No. Ill" (as the case may be), indicate the 
name of the inventor (s) and, next to (each) such name, 
the State fs) (and/or, where applicable, ARIPO, Eurasian, 
European or OAPI patent) for the purposes of which the 
named person is inventor; 

(iv) if, in addition to the agent(s) indicated in Box No. IV, there are 
further agents: in such case, write "Continuation of 
Box No. IV'* and indicate for each further agent the same type 
of information as reqidred in Box No. IV; 

( v) if, in Box No. V, the name of any State (or OAPI) is accompan ied 
by the indication "patent of addition, " or "certificate of 
addition, " or if, in Box No. V, the name of the United States of 
America is accompanied by an indication "continuation " or 
"continuation-in-part": in such case, write "Continuation of 
Box No. V" and the name of each State involved (or OAPI), 
and after the name of each such State (or OAPI), the number of 
the parent title or parent application and the date of grant of 
the parent title or filing of the parent application; 

(vi) if, in Box No. VI, there are more than five earlier applications 
whose priority is claimed: in sitch case, write "Continuation 
of Box No. VI" and indicate for each additional earlier 
application the same type of information as required 
in Box No. VI. 



Continuation of Box No. IV 

PILCH, Adam John Michael 
CRISP, David Norman 
ROBINSON, Nigel Alexander Julian 
HARRIS. Ian Richard 
HARDING, Charles Thomas 
TURNER, James Arthur 
MALLALIEU, Catherine Louise 
PRATT, Richard Wilson 
PRICE, Paul Anthony King 
HORNER, David Richard 
MASCHIO, Antonio 
NACHSHEN, Neil 
POTTER, Julian Mark 
HAINES, Miles John 
DEVILE, Jonathan Mark 
TANNER, James Percival 
KHOO, Chong-Yee 
HOLLIDAY. Louise Caroline 
MILLS, Julia 
HECTOR, Annabel 
ALCOCK. David 
DAVIES, Simon 
DENHOLM, Anna 
MAIN, Malcolm 



2. If, with regard to the precautionary designation statement 
contained in Box No. V, the applicant wishes to exclude any 
State (s) from the scope of that statement: in such case, write 
"Designation(s) excluded from precautionary designation 
statement" and indicate the name or two-letter code of each 
State so excluded. 
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Box No. VI PRIORITY CLAIM 



The priority of the following earlier appIication(s) is hereby claimed: 



Filing date 
of earlier application 

(day/month/year) 



iteni(l) 

^ , 16/5/2001 



item (2) 

. ^18/5/2001 



item (3) 



item (4) 



item (5) 



Number 
of earlier application 



0111959.3 



60/292,141 



Where earlier application is: 



national application: 
comitry 



GB 



US 



regional application:* 
regional Office 



international application; 
receiving Office 



[~1 Further priority claims are indicated in the Supplemental Box. 



The receiving Office is requested to prepare and transmit to the International Bureau a certified copy of the earlier application(s) (only 
if the earlier application was filed with the Office which for the purposes of this international application is the receiving Office) identified 
above as: 

□ all items [X] item (1) □ item (2) □ item (3) □ item (4) □ item (5) □ supplemental Box 

* Where the earlier application is an ARIPO application, indicate at least one country party to the Paris Convention for the Protection of 
Industrial Property or one Member of the World Trade Organization for which that earlier application was filed (Rule 4J0(b)(ii)J: . . 



Box No. Vn INTERNATIONAJL SEARCHING AUTHORITY 



Choice of International Searching Authority (ISA) (if two or more International Searching Authorities are competent to carry out the 
international search, indicate the Authority chosen; the two-letter code may be used): 

ISA/.EP 

Request to use results of earlier search; reference to that search (if an earlier search has been carried out by or requested from the 
International Searching Authority): 

Date (day/month/year) Number Country (or regional Office) 



BoxNo-Vni DECLARATIONS 



The foilowing declarations are contained in Boxes Nos. VIII (i) to (v) (mark the applicable 
check-boxes below and indicate in the right column the number of each type of declaration)'. 



Number of 
declarations 



□ Box No. VlII (i) 
Q Box No. Vm (ii) 

□ Box No. VIII (iii) 
Q Box No. vm (iv) 

□ Box No. VIII (v) 



Declaration as to the identity of the inventor 

Declaration as to the applicant's entitlement, as at the international filing 
date, to apply for and be granted a patent 

Declaration as to the applicant's entitlement, as at the international filing 
date, to claim the priority of the earlier application 

Declaration of inventorship (only for the purposes of the designation of the 
United States of America) 

Declaration as to non-prejudicial disclosures or exceptions to lack of novelty 
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Box No. IX CHECK LIST; LANGUAGE OF FILING 



This intemational application contains: 

(a) the following number of 
sheets in paper form: 

request (including 

declaration sheets) : 6 

description (excluding ^ C^r\^^ 

sequence listing part) tIH CioS 

claims : 6 

abstract : 1 

drawings ^ : 5 

Sub-total number of sheets \ \ ^^26J 

sequence listing pan of 

description (actual number 

of sheets if filed in paper 
form, whether or not also 
filed in computer readable 
form; see Q)) below) 



126 



Total number of sheets 

j^^^^ ^(b) sequence listing part of description filed in 
computer readable form 

(i) □ only (under Section 801 (a)(i)) 

(ii) O in addition to being filed in paper 
form (under Section 801(a)(ii)) 

Type and number of carriers (diskette, 
CD-ROM, CD-R or other) on which the 
sequence listing part is contained (additional 
copies to be indicated under item 9(ii), in 
right column): 



Figure of the drawings which 
should accompany the abstract: 



This intemational application is accompanied by the following 
item(s) (mark the applicable check-boxes below and indicate in 
right column the number of each item): 

1 . IS fee calculation sheet 

2. □ original separate power of attorney 

3. □ original general power of attorney 

4. □ copy of general power of attorney; reference number, * 

if any: . 

5. □ statement explaining lack of signature 

6. □ priority document(s) identified in Box No. VI as 

item(s): ' 

7. []]] translation of intemational application into 

(language): ; 

8. □ separate indications concerning deposited microorganism 

or other biological material 

9. □ sequence listing in computer readable fonn (indicate also type 

and number of carriers (diskette, CD-ROM, CD-R or other )) 
(i) □ copy submitted for the purposes of intemational search 
under Rule 1 3ter only (and not as part of the 
intemational application) ; 

(ii) □ (only where check-box (b)(i) or (b)(ii) is marked in left 
column) additional copies including, where applicable, 
the copy for the purposes of intemational search under 
RulelBter : 

(iii) □ together with relevant statement as to the identity 
of the copy or copies with the sequence listing part 
mentioned in left colxunn 



Number 
of items 



10. □ oihor (specifi/): 



Language of filing of the 
intemational appHcation: 



English 



Box No. X SIGNATURE OF APPLICANT, AGENT OR COMMON REPRESENTATIVE 

Next to each signature, indiccue the name of the person signing and the capacity in which the person signs (if such capacity is not obvUms from reading the request). 




KHOO, Chong-Yee 



. For receiving Office use only , 



1 . Date of actual receipt of the purported 
international application: 



•1i MAY 



12 



3. Corrected date of actual receipt due to later but 
timely received papers or drawings completing 
the purported intemational application: 



4- Date of timely receipt of the required 
corrections xmder PCT Article ll(2): 



5. Intemational Searching Authority 

(if two or more are competent): ISA / 



6. 



□ 



Transmittal of search copy delayed 
imtil search fee is paid 



2. Drawings: 
4^^^ received: 

I I not received: 



For International Bureau use only , 



Date of receipt of the record copy 
by the Intemational Bureau: 
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1 

RECEPTOR 



Field 



This invention relates to newly identified nucleic acids, polypeptides encoded by 
5 them and to their production and use. More particularly, the nucleic acids and polypeptides 
of the present invention relate to a G-protein coupled receptor (GPCR), hereinafter 
referred to as "Mowgli GPCR", and members of the purinoceptor family of GPCRs. The 
invention also relates to inhibiting or activating the action of such nucleic acids and 
polypeptides. 

10 Background 

It is w^ell established that many medically significant biological processes are 
mediated by proteins participating in signal transduction pathways that involve G-proteins 
and/or second messengers, for example, cAMP (Lefkowitz, Nature, 1991 , 35 1 : 353-354). 
These proteins are referred to as proteins participating in pathways with G-proteins or 

15 "PPG proteins". Some examples of these proteins include the GPC receptors, such as those 
for adrenergic agents and dopamine (Kobilka, B. K., et al., Proc. Natl Acad, ScL, USA, 
1987, 84: 46-50; Kobilka B. K., et al.. Science, 1987, 238: 650-656; Bunzow, J. R., et al.. 
Nature, 1988, 336: 783-787), G-proteins themselves, effector proteins, for example, 
phospholipase C, adenyl cyclase, and phosphodiesterase, and actuator proteins, for 

20 example, protein kinase A and protein kinase C (Simon, M, I., et al.. Science, 1991, 252: 
802-8). 

For example, in one form of signal transduction, the effect of hormone binding is 
activation of the enzyme adenylate cyclase inside the cell. Enzyme activation by hormones 
is dependent on the presence of the nucleotide, GTP. GTP also influences hormone 
25 binding. A G-protein connects the hormone receptor to adenylate cyclase. G-protein is 

shown to exchange GTP for bound GDP when activated by a hormone receptor. The GTP 
carrying form then binds to activated adenylate cyclase. Hydrolysis of GTP to GDP, 
catalysed by the G-protein itself, returns the G-protein to its basal, inactive form. Thus, the 



G-protein serves a dual role, as an intermediate that relays the signal from receptor to 
effector, and as a clock that controls the duration of the signal. 

The membrane protein gene superfamily of G-protein coupled receptors (GPCRs) 
has been characterised as having seven putative transmembrane, domains. The domains are 
believed to represent transmembrane a-helices connected by extracellular or cytoplasmic 
loops. G-protein coupled receptors include a wide range of biologically active receptors, 
such as hormone, viral, growth factor and neuroreceptors. 

G-protein coupled receptors (also known as 7TM receptors) have been 
characterised as including these seven conserved hydrophobic stretches of about 20 to 30 
amino acids, connecting at least eight divergent hydrophilic loops. The G-protein family 
of coupled receptors includes dopamine receptors which bind to neuroleptic drugs used for 
treating psychotic and neurological disorders. Other examples of members of this family 
include, but are not limited to, calcitonin, adrenergic, endothelin, cAMP, adenosine, 
muscarinic, acetylcholine, serotonin, histamine, thrombin, kinin, follicle stimulating 
hormone, opsins, endothelial differentiation gene-1, rhodopsins, odorant, and 
cytomegalovirus receptors. 

Most G-protein coupled receptors have single conserved cysteine residues in each 
of the first two extracellular loops which form disulphide bonds that are believed to 
stabilise functional protein structure. The 7 transmembrane regions are designated as TMl, 
TM2, TM3, TM4, TM5, TM6, and TM7. TM3 has been implicated in signal transduction. 

Phosphorylation and lipidation (pamitylation or famesylation) of cysteine residues 
can influence signal transduction of some G-protein coupled receptors. Most G-protein 
coupled receptors contain potential phosphorylation sites wdthin the third cytoplasmic loop 
and/or the carboxy terminus. For several G-protein coupled receptors, such as the p - 
adrenoreceptor, phosphorylation by protein kinase A and/or specific receptor kinases 
mediates receptor desensitization. For some receptors, the ligand binding sites of G- 
protein coupled receptors are beUeved to comprise hydrophilic sockets formed by several 
G-protein coupled receptor transmembrane domains, the sockets being surrounded by 



hydrophobic residues of the G-protein coupled receptors. The hydrophiUc side of each G- 
protein coupled receptor transmembrane helix is thought to face inward and form a polar 
ligand binding site. TM3 has been implicated in several G-protein coupled receptors as 
having a ligand binding site, such as the TM3 aspartate residue. TM5 serines, a TM6 
asparagine and TM6 or TM7 phenylalanines or tyrosines are also implicated in ligand 
binding. 

G-protein coupled receptors can be intracellularly coupled by heterotrimeric G- 
proteins to various intracellular enzymes, ion channels and transporters (see, Johnson et 
aL, Endoc, Rev., 1989, 10: 317-331). Different G-protein a-subunits preferentially 
stimulate particular effectors to modulate various biological functions in a cell. 
Phosphorylation of cytoplasmic residues of G-protein coupled receptors has been 
identified as an important mechanism for the regulation of G-protein coupling of some G- 
protein coupled receptors. G-protein coupled receptors are found in numerous sites within 
a mammalian host. Over the past 15 years, nearly 350 therapeutic agents targeting 7 
transmembrane (7 TM) receptors have been successfully introduced onto the market. 

Thus, G-protein coupled receptors have an established, proven history as 
therapeutic targets. Clearly there is a need for identification and characterization of further 
receptors which can play a role in preventing, ameliorating or correcting dysfunctions or 
diseases, including, but not limited to, infections such as bacterial, fungal, protozoan and 
viral infections, particularly infections caused by HIV-1 or HIV-2; pain; cancers; diabetes, 
obesity; anorexia; bulimia; asthma; Parkinson's disease; thrombosis; acute heart failure; 
hypotension; hypertension; erectile dysfunction; urinary retention; metabolic bone 
diseases such as osteoporisis and osteo petrosis; angina pectoris; myocardial infarction; 
ulcers; asthma; allergies; rheumatoid arthritis; inflanmiatory bowel disease; irritable bowel 
syndrome benign prostatic hypertrophy; and psychotic and neurological disorders, 
including anxiety, schizophrenia, manic depression, delirium, dementia, severe mental 
retardation and dyskinesias, such as Himtington's disease or Gilles dela Tourett's 
syndrome. 
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Summary of the Invention 

According to a first aspect of the present invention, we provide a Mowgli GPCR 
polypeptide comprising the amino acid sequence shown in SEQ ID NO: 3, SEQ ID NO: 5, 
SEQ ID NO: 6 or SEQ ID NO: 8, or a homologue, variant or derivative thereof. 

There is provided, according to a second aspect of the present invention, a nucleic 
acid capable of encoding a polypeptide according to the first aspect of the invention. 
Preferably, the nucleic acid comprises the nucleic acid sequence shown in SEQ ID NO: 1, 
SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 7 or SEQ ID NO: 10, or a homologue, 
variant or derivative thereof. 

10 We provide, according to a third aspect of the present invention, a polypeptide 

comprising a fragment of a polypeptide according to the first aspect of the invention. 

Preferably, such a fragment containing polypeptide comprises one or more regions 
which are homologous between any two or more of SEQ ID NO: 3, SEQ ID NO: 5, SEQ 

ID NO: 6 and SEQ ID NO: 8, or which comprises one or more regions which are 
heterologous between any two or more of SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 6 
and SEQ ID NO: 8. 

As a fourth aspect of the present invention, there is provided a nucleic acid capable 
of encoding a polypeptide according to the third aspect of the invention. 

We provide, according to a fifth aspect of the present invention, a vector 
20 comprising a nucleic acid according to the second or fourth aspect of the invention. 

The present invention, in a sixth aspect, provides a host cell comprising a nucleic 
acid according to the second or fourth aspect of the invention, or vector according to the 
fifth aspect of the invention. 





In a seventh aspect of the present invention, there is provided a transgenic non- 
human animal comprising a nucleic acid according to the second or fourth aspect of the 
invention or a vector according to the fifth aspect of the invention. Preferably, the 
transgenic non-human animal is a mouse. 

According to an eighth aspect of the present invention, we provide use of a 
polypeptide according to the first or third aspect of the invention in a method of 
identifying compound which is capable of interacting specifically with a G protein 
coupled receptor. 

We provide, according to a ninth aspect of the invention, use of a transgenic non- 
human animal according to the seventh aspect of the invention in a method of identifying a 
compound which is capable of interacting specifically with a G protein coupled receptor. 

There is provided, in accordance with a tenth aspect of the present invention, a 
method for identifying an antagonist of a Mowgli GPCR, the method comprising 
contacting a cell which expresses Mowgli receptor with a candidate compound and 
determining whether the level of cyclic AMP (cAMP) in said cell is lowered as a result of 
said contacting. 

As an eleventh aspect of the invention, we provide a method for identifying a 
compound capable of lowering the endogenous level of cyclic AMP in a cell which ^ 
method comprises contacting a cell which expresses a Mowgli GPCR with a candidate 
compound and determining whether the level of cyclic AMP (cAMP) in said cell is 
lowered as a result of said contacting. 

According to a twelfth aspect of the invention, we provide a method for identifying 
a compound capable of binding to a Mowgli GPCR polypeptide, the method comprising 
contacting a Mowgli GPCR polypeptide with a candidate compound and determining 
whether the candidate compound binds to the Mowgli GPCR polypeptide. 



We provide, according to a thirteenth aspect of the invention, there is provided a 
compound identified by a method according to any of the eighth to twelfth aspects of the 
invention. 

According to a fourteenth aspect of the present invention, we provide a compound 
capable of binding specifically to a polypeptide according to the first or third aspect of the 
invention. 

There is provided, according to a fifteenth aspect of the present invention, use of a 
polypeptide according to the first or third aspect of the invention, or part thereof; or a 
nucleic acid according to the second or fourth aspect of the invention, or part thereof, in a 
method for producing antibodies. 

We provide, according to a sixteenth aspect of the present invention, an antibody 
capable of binding specifically to a polypeptide according to the first or third aspect of the 
invention, or part thereof; or a polypeptide encoded by a nucleic acid according to the 
second or fourth aspect of the invention, or part thereof;. 

As a seventeenth aspect of the present invention, there is provided a 
pharmaceutical composition comprising any one or more of the following: a polypeptide 
according to the first or third aspect of the invention, or part thereof; a polypeptide 
encoded by a nucleic acid according to the second or fourth aspect of the invention, or part 
thereof; a vector according to the fifth aspect of the invention; a cell according to the sixth 
aspect of the invention; a compound according to the thirteenth or fourteenth aspect of the 
invention; and an antibody according to the sixteenth aspect of the invention, together with 
a pharmaceutically acceptable carrier or diluent. 

We provide, according to a eighteenth aspect of the present invention, a vaccine 
composition comprising any one or more of the following: a polypeptide according to the 
first or third aspect of the invention, or part thereof; a polypeptide encoded by a nucleic 
acid according to the second or fourth aspect of the invention, or part thereof; a vector 
according to the fifth aspect of the invention; a cell according to the sixth aspect of the 



invention; a compound according to the thirteenth or fourteenth aspect of the invention; 
and an antibody according to the sixteenth aspect of the invention. 

According to an nineteenth aspect of the present invention, we provide a diagnostic 
kit for a disease or susceptibihty to a disease comprising any one or more of the following: 
a polypeptide according to the first or third aspect of the invention, or part thereof; a 
polypeptide encoded by a nucleic acid according to the second or fourth aspect of the 
invention, or part thereof; a vector according to the fifth aspect of the invention; a cell 
according to the sixth aspect of the invention; a compound according to the thirteenth or 
fourteenth aspect of the invention; and an antibody according to the sixteenth aspect of the 
invention. 

We provide, according to a twentieth aspect of the invention, a method of treating 
a patient suffering from a disease associated with enhanced activity of a MowgH GPCR, 
which method comprises administering to the patient an antagonist of Mowgli GPCR. 

There is provided, in accordance with a twenty-first aspect of the present 
invention, a method of treating a patient suffering from a disease associated with reduced 
activity of a Mowgli GPCR, which method comprises administering to the patient an 
agonist of Mowgli GPCR. 

Preferably, the Mowgli GPCR comprises a polypeptide having the sequence shown 
in SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 6 or SEQ ID NO: 8, or a homologue, 
variant or derivative thereof, preferably SEQ ID NO: 3 or SEQ ID NO: 5. 

According to a twenty-second aspect of the present invention, we provide a method 
for treating and/or preventing a disease in a patient, which comprises the step of 
administering any one or more of the following to the patient: a polypeptide according to 
the first or third aspect of the invention, or part thereof; a polypeptide encoded by a 
nucleic acid according to the second or fourth aspect of the invention, or part thereof; a 
vector according to the fifth aspect of the invention; a cell according to the sixth aspect of 
the invention; a compound according to the thirteenth or fourteenth aspect of the 
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invention; and an antibody according to the sixteenth aspect of the invention; a 
pharmaceutical composition according to the seventeenth aspect of the invention; and a 
vaccine according to the eighteenth aspect of the invention, to the subject. 

There is provided, according to a tvvrenty-third aspect of the present invention, an 
agent comprising a polypeptide according to the first or third aspect of the invention, or 
part thereof; a polypeptide encoded by a nucleic acid according to the second or fourth 
aspect of the invention, or part thereof; a vector according to the fifth aspect of the 
invention; a cell according to the sixth aspect of the invention; a compound according to 
the thirteenth or fourteenth aspect of the invention; and an antibody according to the 
sixteenth aspect of the invention, said agent for use in a method of treatment or 
prophylaxis of disease. 

We provide, according to a tw^enty-fourth aspect of the present invention, use of a 
polypeptide according to the first or third aspect of the invention, or part thereof; a . 
polypeptide encoded by a nucleic acid according to the second or fourth aspect of the 
invention, or part thereof; a vector according to the fifth aspect of the invention; a cell 
according to the sixth aspect of the invention; a compound according to the thirteenth or 
fourteenth aspect of the invention; and an antibody according to the sixteenth aspect of the 
invention, for the preparation of a pharmaceutical composition for the treatment or 
prophylaxis of a disease. 

As a twenty-fifth aspect of the present invention, there is provided non-human 
transgenic animal, characterized in that the transgenic animal comprises an altered Mowgli 
gene. Preferably, the alteration is selected firom the group consisting of:, a deletion of 
Mowgli, a mutation in Mowgli resulting in loss of fimction, introduction of an exogenous 
gene having a nucleotide sequence with targeted or random mutations into Mowgli, 
introduction of an exogenous gene from another species into Mowgli, and a combination 
of any of these. 

We provide, according to a twenty-sixth aspect of the present invention, a non- 
human transgenic animal having a functionally disrupted endogenous Mowgli gene, in 



which the transgenic animal comprises in its genome and expresses a transgene encoding a 
heterologous Mowgli protein. 

The present invention, in a twenty-seventh aspect, provides a nucleic acid construct 
for functionally disrupting a Mowgli gene in a host cell, the nucleic acid construct 
comprising: (a) a non-homologous replacement portion; (b) a first homology region 
located upstream of the non-homologous replacement portion, the first homology region 
having a nucleotide sequence with substantial identity to a first Mowgli gene sequence; 
and (c) a second homology region located downstream of the non-homologous 
replacement portion, the second homology region having a nucleotide sequence with 
substantial identity to a second Mowgli gene sequence, the second Mowgli gene sequence 
having a location downstream of the first Mowgli gene sequence in a naturally occurring 
endogenous Mowgli gene. 

According to a twenty-eighth aspect of the present invention, we provide a process 
for producing a Mowgli GPCR polypeptide, the method comprising culturing a host cell 
according to the sixth aspect of the invention under conditions in which a nucleic . acid 
encoding a Mowgli GPCR polypeptide is expressed. 

There is provided, according to a twenty-ninth aspect of the present invention, a 
method of detecting the presence of a nucleic acid according to the second or fourth aspect 
of the invention in a sample, the method comprising contacting the sample with at least 
one nucleic acid probe which is specific for said nucleic acid and monitoring said sample 
for the presence of the nucleic acid. 

We provide, according to a thirtieth aspect of the present invention, a method of 
detecting the presence of a polypeptide according to the first or third aspect of the 
invention in a sample, the method comprising contacting the sample with an antibody 
according to the sixteenth aspect of the invention and monitoring said sample for the 
presence of the polypeptide. 
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As a thirty-first aspect of the present invention, there is provided a method of 
diagnosis of a disease or syndrome caused by or associated with increased, decreased or 
othenvise abnomial expression of Mowgli GPCR, the method comprising the steps of: (a) 
detecting the level or pattem of expression of Mowgli GPCR in an animal suffering or 
suspected to be suffering from such a disease; and (b) comparing the level or pattem of 
expression with that of a noraial animal. 

Preferably, the disease is selected from the group consisting of: platelet ADP 
receptor defect; autosomal recessive deafness; myotonic dystrophy; essential 
hypertension; Usher syndrome, type 3; juvenile-onset cataract; congenital cataract; and 3- 
methylcrotonylglycinuria 1. 

Brief Description of the Drawings 

Figure 1 is a diagram showing the results of analysis of the human Mowgli 
polypeptide (SEQ ID NO: 3) using the HMM structural prediction software of pfam 
(http ://www. Sanger, ac . uk/Soft ware/Pfam/search. shtml) . 

Figure 2 is a diagram showing an expression profile for human Mowgli GPCR 
generated by reverse transcription-polymerase chain reaction (RT-PCR). 

Figure 3 is a diagram showing the structure of the mouse Mowgli locus before 
knock-out. 

Figure 4 is a diagram showing the structure of the mouse Mowgli locus after 
knock-out. 

Figure 5 is a diagram showing the structure of the vector used for mouse Mowgli 
knock-out, including cloning sites. 



Sequence Listings 
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SEQ ID NO: 1 shows the cDNA sequence of human Mowgli. SEQ ID NO: 2 
shows an open reading frame derived from SEQ ID NO: 1. SEQ ID NO: 3 shows the 
amino acid sequence of human Mowgli. SEQ ID NO: 4 shows the open reading frame of 
a cDNA for Mouse Mowgli. SEQ ID NO: 5 shows the amino acid sequence of Mouse 
Mowgli. 

SEQ ID NO: 6 shows the amino acid sequence of a fusion protein of Mowgli with 
V5 and His tags at c-terminal end. SEQ ID NO: 7 shows a nucleotide sequence 
comprising Mowgli with Nhel and Hindlll at the 5-prime and 3-prime ends, generated by 
PGR. SEQ ID NO: 8 shows the amino acid sequence of Mowgli fusion protein 
comprismg a C terminal FLAG tag. SEQ ID NO: 10 shows the nucleotide sequence of a 
Mowgli genomic locus from 5'prF to 3'prR, including translated coding amino acid 
sequence. 

Where two sequences are set out under a particular sequence identifier (i.e., SEQ 
ID NO: NN), both sequences should understood to be encompassed by reference to that 
particular SEQ ID NO:. 

Detailed Description 

Mowgli GPCR 

Our invention relates in general to a novel G-Protein Coupled Receptor (GPCR), in 
particular, an orphan purinoceptor type G-protein coupled receptor, which we refer to as 
Mowgli GPCR, as well as homologues, variants or derivatives thereof. 

Mowgli is structurally related to other proteins of the G-protein coupled receptor 
family, as shown by the results of sequencing the amplified cDNA products encoding 
human Mowgli. The cDNA sequence of SEQ ID NO: 1 contains an open reading frame 
(SEQ ID NO: 2, nucleotide numbers 326 to 1400) encoding a polypeptide of 358 amino 



acids shown in SEQ ID NO: 3. Human Mowgli is found to map to Homo sapiens 
chromosome 3 q2 1 -q25 . 



Identities and Similarities to Mowgli 

The amino acid sequence of Mowgli has about 47 % identity and 67 % similarity 
5 (using BLAST) in 309 amino acid residues with human UDP-glucose nucleoside GPCR 
KiaaOOOl (Accession # D13626 Nomura,N,direct Submission 03-FEB-1999 related 
accession number: NP_055694). Mowgli amino acid sequence is 100% homologous to a 
computer prediction of GPR87 (Accession # AF237763_1 Wittenberger,T., Schaller,H.C. 
and Hellebrand,S."An expressed sequence tag (est) data mining strategy succeeding in the 
10 discovery of new g-protein coupled receptors. J. Mol. Biol. 307 (3), 799-813 (2001)). 

The nucleotide sequence of Mowgli (SEQ ID NO: 1) has about 99% identity (using 
BLAST) in 697 nucleotide residues with the anonymous Homo sapiens EST transitional 
cell papilloma cell line bladder cDNA from (Accession # BG286447 Robert Strausberg, 
NCI-CGAP http://www.ncbi.nlm.nih.gov/ncicgap, 20-Feb-2001). Fiirthermore, Mowgli 

15 (SEQ IDNo: 1) is about 97% identical in 745 nucleotide residues to the anonymous Homo 
sapiens EST pancreatic adenocarcinoma cDNA from (Accession # BE56361 1 Robert 
Strausberg, NCI-CGAP http://www.ncbi.nlm.nih.gov/ncicgap, Aug 10 2000). MowgU 
(SEQ ID NO: 1) is also about 98% identical to: the anonymous Homo sapiens ESTs: 
Accession # BF028445, Accession # BE879604; Accession # BG677820; Accession 

20 #AW998526; the ESTs described as similar to SW:KI01_HUMAN Q15391 PROBABLE 
G PROTEIN-COUPLED RECEPTOR KIAAOOOl (Accession # AI659657, Accession # 
W79920, Accession # AI028704, Accession # AW873727, Accession # AI743546, 
Accession # AI659657); the EST described as similar to SW:P2UR__RAT P41232 P2U 
PURINOCEPTOR (Accession # W79123). 

25 Analysis of the Mowgli polypeptide (SEQ ID NO: 3) using the HMM structural 

prediction software of pfam rhttp://wAv-w.sauger>ac.uk/SQftware/Pfam/search.shtmn 
confirms that Mowgli peptide is a GPCR of the 7TM-1 structural class (see Figure 1). 



VI 
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Hiiman and mouse Mowgli GPCR are therefore members of a large family of G 
Protein Coupled Receptors (GPCRs). 

Expression Profile of Mowgli 

Polymerase chain reaction (PGR) amplification of Mowgli cDNA detects 
expression of Mowgli to varying abundance in human leukocytes, lung, prostate, ovary, 
and testis.. An expression profile of Mowgli GPCR is shown in Figure 2. Using Mowgli 
cDNA of SEQ ID NO: 1 to search the human EST data sources by BLASTN, identities are 
found in cDNA derived from libraries originating from adenocarcinoma from pancreas 
(Accession # BE56361 1), undifferentiated large cell carcinoma from lung (Accession # 
BE879604), carcinoma cell line firom bladder (Accession # BF028445), adult normal 
breast tissue (Accession #AW998256), squamous cell carcinoma from skin (Accession 
#BG677820), fetal heart (Accession # W79920 and Accession # W79123), testis 
(Accession # AI028704), neuroblastoma from adrenal gland (Accession # AW873727), 
prostate (Accession # AI659657), pooled: fetal lung , testis, and B-cell (Accession # 
AI743546) and transitional cell papilloma cell line firom bladder (Accession # BG286447). 
This indicates that Mowgli is expressed in these normal or abnormal tissues. Accordingly, 
the Mowgli polypeptides, nucleic acids, probes, antibodies, expression vectors and ligands 
are useful for detection, diagnosis, treatment and other assays for diseases associated with 
over-, under- and abnormal expression of Mowgli GPCR in these and other tissues. 

Furthermore, the Mowgli polypeptides, nucleic acids, probes, antibodies, 
expression vectors and ligands are useful as (and for the production of) biosensors. 
According to Aizawa (1988), Anal, Chem. Symp. 17: 683, a biosensor is defined as being a 
imique combination of a receptor for molecular recognition, for example a selective layer 
with immobilized antibodies or receptors such as a Movvgli G-protein coupled receptor, 
and a transducer for transmitting the values measured. One group of such biosensors will 
detect the change which is caused in the optical properties of a surface layer due to the 
interaction of the receptor with the surrounding medium. Among such techniques may be 
mentioned especially ellipso-metry and surface plasmon resonance. Biosensors 
incorporating Mowgli may be used to detect the presence or level of Mowgli ligands, for 
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example, nucleotides such as purines or purine einaiogues, or analogues of these ligands. 
The construction of such biosensors is well known in the art. Thus, cell lines expressing 
Mowgli receptor may be used as reporter systems for detection of ligands such as ATP via 
receptor-promoted formation of [3H]inositol phosphates or other second messengers (Watt 
5 et al., 1998, J Biol Chem May 29;273(22): 14053-8). Receptor-Hgand biosensors are also 
described in Hoffman Qi ?lU 2000, Proc Natl Acad Sci USA Oct 10;97(21):11215-20. 
Optical and other biosensors comprising Mowgli may also be used to detect the level or 
presence of interaction with G-proteins and other proteins, as described by, for example, 
Figler et al, 1997, Biochemistry Dec 23;36(51):16288-99 and Sarrio et aL, 2000, Mol Cell 
^ 10 Biol 2000 Jul;20(14):5 164-74). Sensor units for biosensors are described in, for example, 
US 5,492,840. 



This and other embodiments of the invention will be described in further detail 

below. 



Methods Employed 

15 The practice of the present invention will employ, unless otherwise indicated, 

conventional techniques of chemistry, molecular biology, microbiology, recombinant 
DNA and immunology, which are within the capabilities of a person of ordinary skill in - 
the art. Such techniques are explained in the literature. See, for example, J. Sambrook, E. 
F. Fritsch, and T. Maniatis, 1989, Molecular Cloning: A Laboratory Manual, Second 

20 Edition, Books 1-3, Cold Spring Harbor Laboratory Press; Ausubel, F. M. et al. (1995 and 
periodic supplements; Current Protocols in Molecular Biology, ch. 9, 13, and 16, John 
Wiley & Sons, New York, N.Y.); B. Roe, J. Crabtree, and A. Kahn, 1996, DNA Isolation 
and Sequencing: Essential Techniques, John Wiley & Sons; J. M. Polak and James O'D. 
McGee, 1990, In Situ Hybridization: Principles and Practical Oxford University Press; M. 

25 J. Gait (Editor), 1984, Oligonucleotide Synthesis: A Practical Approach, W Press; and, D. 
M. J. Lilley and J. E. Dahlberg, 1992, Methods of Enzymology: DNA Structure Part A: 
Synthesis and Physical Analysis of DNA Methods in Enzymology, Academic Press. Each 
of these general texts is herein incorporated by reference. 
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MowGLi GPCR Polypeptides 

As used here, the term "Mowgli GPCR polypeptide" is intended to refer to a 
polypeptide comprising the amino acid sequence shown in any of SEQ ID NO: 3, SEQ ID 
NO: 5, SEQ ID NO: 6 or SEQ ID NO: 8, or a homologue, variant or derivative thereof. 
Preferably, the polypeptide comprises or is a homologue, variant or derivative of the 
sequence shown in SEQ ID NO: 3. 

"Polypeptide" refers to any peptide or protein comprising two or more amino acids 
joined to each other by peptide bonds or modified peptide bonds, i.e., peptide isosteres. 
"Polypeptide" refers to both short chains, commonly referred to as peptides, oligopeptides 
or oligomers, and to longer chains, generally referred to as proteins. Polypeptides may 
contain amino acids other than the 20 gene-encoded amino acids. 

"Polypeptides" include amino acid sequences modified either by natural processes, 
such as post-translational processing, or by chemical modification techniques which are 
well known in the art. Such modifications are well described in basic texts aiid in more 
detailed monographs, as well as in a voluminous research literature. Modifications can 
occur anywhere in a polypeptide, including the peptide backbone, the amino acid side- 
chains and the amino or carboxyl termini. It will be appreciated that the same type of 
modification may be present in the same or varying degrees at several sites in a given 
polypeptide. Also, a given polypeptide may contain many types of modifications. 

Polypeptides may be branched as a resuh of ubiquitination, and they may be cyclic, 
with or without branching. Cyclic, branched and branched cyclic polypeptides may result 
from posttranslation natural processes or may be made by synthetic methods. 
Modifications include acetylation, acylation, ADP-ribosylation, amidation, covalent 
attachment of flavin, covalent attachment of a heme moiety, covalent attachment of a 
nucleotide or nucleotide derivative, covalent attachment of a lipid or lipid derivative, 
covalent attachment of phosphotidylinositol, cross-inking, cyclization, disulfide bond 
formation, demethylation, formation of covalent cross-inks, formation of cystine, 
formation of pyroglutamate, formylation, gamma-carboxylation, glycosylation, GPI 
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anchor formation, hydroxylation, iodination, methylation, myristoylation, oxidation, 
proteolytic processing, phosphorylation, prenylation, racemization, selenoylation, 
sulfation, transfer-RNA mediated addition of amino acids to proteins such as arginylation, 
and ubiquitination. See, for instance. Proteins - Structure and Molecular Properties, 2nd 
5 Ed,, T. E. Creighton, W. H. Freeman and Company, New York, 1993 and Wold, F., 
Posttranslational Protein Modifications: Perspectives and Prospects, pgs. 1-12 in 
Posttranslational Covalent Modification of Proteins, B. C. Johnson, Ed., Academic Press, 
New York, 1983; Seifter et al, "Analysis for protein modifications and nonprotein 
cofactors", Meth Enzymol (1990) 182:626-646 and Rattan et aL, "Protein Synthesis: 
I 1 0 Posttranslational Modifications and Aging", Ann NY Acad Sci (1 992) 663 :48-62. 



The terms "variant", "homologue", "derivative" or "fragment" in relation to the 
present invention include any substitution of, variation of, modification of, replacement of, 
deletion of or addition of one (or more) amino acid from or to a sequence. Unless the 
context admits otherwise, references to "Mowgii" and "Mowgli GPCR" include references 
15 to such variants, homologues, derivatives and fragments of Mowgli. 

Preferably, as applied to Mowgli, the resultant amino acid sequence has GPCR 
activity, more preferably having at least the same activity of the Mowgli GPCR shown as 
SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 6 or SEQ ID NO: 8. In particular, the term 
"homologue" covers identity with respect to structure and/or function providing the 

20 resultant amino acid sequence has GPCR activity. With respect to sequence identity (i.e. 
similarity), preferably there is at least 70%, more preferably at least 75%, more preferably 
at least 85%, even more preferably at least 90% sequence identity. More preferably there 
is at least 95%, more preferably at least 98%, sequence identity. These terms also 
encompass polypeptides derived from amino acids which are allelic variations of the 

25 Mowgli GPCR nucleic acid sequence. 

Where reference is made to the "receptor activity" or "biological activity" of a 
receptor such as Mowgli GPCR, these terms are intended to refer to the metabolic or 
physiological function of the Mowgli receptor, including similar activities or improved 
activities or these activities with decreased undesirable side effects. Also included are 
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antigenic and immunogenic activities of the Mowgli receptor. Examples of GPCR activity, 
and methods of assaying and quantifying these activities, are knovm in the art, and are 
described in detail elsewhere in this document. 

As used herein a "deletion" is defined as a change in either nucleotide or amino 
acid sequence in v^hich one or more nucleotides or amino acid residues, respectively, are 
absent. As used herein an "insertion" or "addition" is that change in a nucleotide or amino 
acid sequence which has resulted in the addition of one or more nucleotides or amino acid 
residues, respectively, as compared to the naturally occurring substance. As used herein 
"substitution" results from the replacement of one or more nucleotides or amino acids by 
different nucleotides or amino acids, respectively. 

Mowgli polypeptides according to the present invention may also have deletions, 
insertions or substitutions of amino acid residues which produce a silent change and result 
in a functionally equivalent amino acid sequence. Deliberate amino acid substitutions may 
be made on the basis of similarity in polarity, charge, solubility, hydrophobicity, 
hydrophilicity, and/or the amphipathic nature of the residues. For example, negatively 
charged amino acids include aspartic acid and glutamic acid; positively charged amino 
acids include lysine and arginine; and amino acids with uncharged polar head groups 
having similar hydrophilicity values include leucine, isoleucine, valine, glycine, alanine, 
asparagine, glutamine, serine, threonine, phenylalanine, and tyrosine. 

Conservative substitutions may be made, for example according to the table below. 
Amino acids in the same block in the second column and preferably in the same line in the 
third column may be substituted for each other: 
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ALIPHATIC 


Non-polar 


GAP 






IL V 




Polar - uncharged 


CSTM 






NQ 




Polar - charged 


DE 






KR 


AROMATIC 




HF W Y 



Movvgli polypeptides of the invention may further comprise heterologous amino 
acid sequences, typically at the N-terminus or C-terminus, preferably the N-terminus. 
Heterologous sequences may include sequences that affect intra or extracellular protein 
targeting (such as leader sequences). Heterologous sequences may also include sequences 
that increase the immunogenicity of the polypeptide of the invention and/or which 
facilitate identification, extraction and/or purification of the polypeptides. Another 
heterologous sequence that is particularly preferred is a polyamino acid sequence such as 
polyhistidine which is preferably N-terminal. A polyhistidine sequence of at least 10 
amino acids, preferably at least 1 7 amino acids but fewer than 50 amino acids is especially 
preferred. 

The Mowgli GPCR polypeptides may be in the form of the "mature" protein or 
may be a part of a larger protein such as a fusion protein. It is often advantageous to 
include an additional amino acid sequence which contains secretory or leader sequences, 
pro-sequences, sequences which aid in purification such as multiple histidine residues, or 
an additional sequence for stability during recombinant production. 

Mowgli polypeptides of the invention are advantageously made by recombinant 
means, using known techniques. However they may also be made by synthetic means 
using techniques well known to skilled persons such as solid phase synthesis. Polypeptides 
of the invention may also be produced as fusion proteins, for example to aid in extraction 
and purification. Examples of fusion protein partners include glutathione-S-transferase 
(GST), 6xHis, GAL4 (DNA binding and/or transcriptional activation domains) and (3- 
galactosidase. It may also be convenient to include a proteolytic cleavage site between the 
fusion protein partner and the protein sequence of interest to allow removal of fusion 
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protein sequences, such as a thrombin cleavage site. Preferably the fusion protein will not 
hinder the function of the protein of interest sequence. 

Mowgli polypeptides of the invention may be in a substantially isolated form. This 
term is intended to refer to alteration by the hand of man from the natural state. If an 
"isolated" composition or substance occurs in nature, it has been changed or removed 
from its original environment, or both. For example, a polynucleotide, nucleic acid or a 
polypeptide naturally present in a living animal is not "isolated," but the same 
polynucleotide, nucleic acid or polypeptide separated from the coexisting materials of its 
natural state is "isolated", as the term is employed herein. 

It will however be understood that the Mowgli GPCR protein may be mixed with 
carriers or diluents which will not interfere with the intended purpose of the protein and 
still be regarded as substantially isolated. A polypeptide of the invention may also be in a 
substantially purified form, in which case it will generally comprise the protein in a 

preparation in which more than 90%, for example, 95%, 98% or 99% of the protein in the 
preparation is a Mowgli GPCR polypeptide of the invention. 

The present invention also relates to peptides comprising a portion of a Mowgli 
polypeptide according to the invention. Thus, fragments of Mowgli GPCR and its 
homologues, variants or derivatives are included. The peptides of the present invention 
may be between 2 and 200 amino acids, preferably between 4 and 40 amino acids in 
length. The peptide may be derived from a Mowgli GPCR polypeptide as disclosed here, 
for example by digestion with a suitable enzyme, such as trypsin. Altematively the 
peptide, fragment, etc may be made by recombinant means, or synthesised synthetically. 

The term "peptide" includes the various synthetic peptide variations known in the 

art, such as a retroinverso D peptides. The peptide may be an antigenic determinant and/or 
a T-cell epitope. The peptide may be immunogenic in vivo. Preferably the peptide is 
capable of inducing neutralising antibodies in vivo. 
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By aligning Mowgli GPCR sequences from different species, it is possible to 
determine which regions of the amino acid sequence are conserved between different 
species ("homologous regions"), and which regions vary between the different species 
("heterologous regions"). 



5 The Mowgli polypeptides according to the invention may therefore comprise a 

sequence which corresponds to at least part of a homologous region. A homologous region 
shows a high degree of homology between at least two species. For example, the 
homologous region may show at least 70%, preferably at least 80%, more preferably at 
least 90%, even more preferably at least 95% identity at the amino acid level using the 
10 tests described above. Peptides which comprise a sequence which corresponds to a 
homologous region may be used in therapeutic strategies as explained in further detail 
below. Altematively, the Mowgli GPCR peptide may comprise a sequence which 
corresponds to at least part of a heterologous region. A heterologous region shows a low 
degree of homology between at least two species. 



1 5 Mowgli GPCR Polynucleotides and Nucleic Acids 



This invention encompasses Mowgli polynucleotides, Mowgli nucleotides and 
Mowgli nucleic acids, methods of production, uses of these, etc, as described in further 
detail elsewhere in this document. 



The terms "Mowgli polynucleotide", "MowgU nucleotide" and "Mowgli nucleic 
20 acid" may be used interchangeably, and are intended to refer to a polynucleotide/nucleic 
acid comprising a nucleic acid sequence as shown in SEQ ID NO: 1, SEQ ID NO: 2, SEQ 
ID NO: 4, SEQ ID NO: 7 or SEQ ID NO: 10, or a homologue, variant or derivative 
thereof. Preferably, the polynucleotide/nucleic acid comprises or is a homologue, variant 
or derivative of the nucleic acid sequence SEQ ID NO: 1 or SEQ ID NO: 2, most 
25 preferably, SEQ ID NO: 2. 

These terms are also intended to include a nucleic acid sequence capable of 
encoding a polypeptides and/or a peptide of the present invention, i.e., a Mowgli 
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polypeptide. Thus, Mowgli GPCR polynucleotides and nucleic acids comprise a 
nucleotide sequence capable of encoding a polypeptide comprising the amino acid 
sequence shown in SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 6 or SEQ ID NO: 8, or a 
homologue, variant or derivative thereof. Preferably, the Mowgli GPCR polynucleotides 
and nucleic acids comprise a nucleotide sequence capable of encoding a polypeptide 
comprising the amino acid sequence shown in SEQ ID NO: 3, or a homologue, variant or 
derivative thereof. 

"Polynucleotide" generally refers to any polyribonucleotide or 
polydeoxribonucleotide, which may be unmodified RNA or DNA or modified RNA or 
DNA. "Polynucleotides" include, without limitation single- and double-stranded DNA, 
DNA that is a mixture of single- and double-stranded regions, single- and double-stranded 
RNA, and RNA that is mixture of single- and double-stranded regions, hybrid molecules 
comprising DNA and RNA that may be single-stranded or, more typically, double- 
stranded or a mixture of single- and double-stranded regions. In addition, "polynucleotide" 
refers to triple-stranded regions comprising RNA or DNA or both RNA and DNA. The 
term polynucleotide also includes DNAs or RNAs containing one or more modified bases 
and DNAs or RNAs with backbones modified for stability or for other reasons. 
"Modified" bases include, for example, tritylated bases and unusual bases such as inosine. 
A variety of modifications has been made to DNA and RNA; thus, "polynucleotide" 
embraces chemically, enzymatically or metabolically modified forms of polynucleotides 
as typically found in nature, as well as the chemical forms of DNA and RNA characteristic 
of viruses and cells. "Polynucleotide" also embraces relatively short polynucleotides, often 
referred to as oligonucleotides. 

It will be understood by the skilled person that numerous nucleotide sequences can 
encode the same polypeptide as a result of the degeneracy of the genetic code. 

As used herein, the term "nucleotide sequence" refers to nucleotide sequences, 
oligonucleotide sequences, polynucleotide sequences and variants, homologues, fragments 
and derivatives thereof (such as portions thereof). The nucleotide sequence may be DNA or 
RNA of genomic or synthetic or recombinant origin which may be double-stranded or 
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single-stranded whether representing the sense or antisense strand or combinations thereof. 
The term nucleotide sequence may be prepared by use of recombinant DNA techniques 
(for example, recombinant DNA). 

Preferably, the term "nucleotide sequence" means DNA. 

The terms "variant", "homologue", "derivative" or "fragment" in relation to the 
present invention include any substitution of, variation of, modification of, replacement of, 
deletion of or addition of one (or more) nucleic acids from or to the sequence of a Mowgli 
nucleotide sequence. Unless the context admits otherwise, references to "Mowgli" and 
"Mowgli GPCR" include references to such variants, homologues, derivatives and 
fragments of Mowgli. 

Preferably, the resultant nucleotide sequence encodes a polypeptide having GPCR 
activity, preferably having at least the same activity of the GPCR shown as SEQ ID NO: 
3, SEQ ID NO: 5, SEQ ID NO: 6 or SEQ ID NO: 8. Preferably, the term "homologue" is 
intended to cover identity with respect to structure and/or function such that the resultant 
nucleotide sequence encodes a polypeptide which has GPCR activity. With respect to 
sequence identity (i.e. similarity), preferably there is at least 70%, more preferably at least 
75%, more preferably at least 85%, more preferably at least 90% sequence identity. More 
preferably there is at least 95%, more preferably at least 98%, sequence identity. These 
terms also encompass allelic variations of the sequences. 

Mowgli Control Regions 

For some purposes, it may be necessary to utilise or investigate control regions of 
Mowgli. Such control regions include promoters, enhancers and locus control regions. By 
a control region we mean a nucleic acid sequence or structure which is capable of 
modulating the expression of a coding sequence which is operatively linked to it. 

For example, control regions are useful in generating transgenic animals 
expressing Mowgli. Furthermore, control regions may be used to generate expression 
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constructs for Mowgli. Control regions from different individuals in a population may be 
sequenced to identify non-coding polymorphisms, which may affect the expression level 
of Mowgli. This is described in further detail below. 

Identification of control regions of Mowgli is straightforward, and may be carried 
out in a number of ways. For example, the coding sequence of Mowgli may be obtained 
from an organism, by screening a cDNA library using a human or mouse Mowgli cDNA 
sequence as a probe. 5' sequences may be obtained by screening an appropriate genomic 
library, or by primer extension as known in the art. Database searching of genome 
databases may also be employed. Such 5' sequences which are particularly of interest 
include non-coding regions. The 5' regions may be examined by eye, or with the aid of 
computer programs, to identify sequence motifs which indicate the presence of promoter 
and/or enhancer regions. 

Furthermore, sequence alignments may be conducted of Mowgli nucleic acid 
sequences from two or more organisms. By aligning Mowgli GPCR sequences from 
different species, it is possible to determine which regions of the nucleotide sequence are 
conserved between different species. Such conserved regions are likely to contain control 
regions for the gene in question (i.e., Mowgli). The mouse and human genomic sequences 
upstream of the sequences disclosed here, may be employed for this purpose. Furthermore, 
Mowgli homologues from other organisms may be obtained using standard methods of 
screening using appropriate probes generated from the mouse and human Mowgli 
sequences. The genome of the pufferfish (Takifugu rubripes) may also be screened to 
identify a Mowgli homologue; comparison of the 5' non-coding region of the Fugu 
Mowgli gene with a mouse or human genomic Mowgli sequence (e.g., SEQ ID NO: 10, 
mouse Mowgli genomic sequence) may be used to identify conserved regions containing 
control regions. 

Deletion studies may also be conducted to identify promoter and/or enhancer 
regions for Mowgli. 



24 



The identity of putative control regions may be confirmed by molecular biology 
experiments, in which the candidate sequences are linked to a reporter gene and the 
expression of the reporter detected. 

Calculation of Sequence Homology 

Sequence identity with respect to any of the sequences presented here can be 
determined by a simple "eyeball" comparison (i.e. a strict comparison) of any one or more 
of the sequences with another sequence to see if that other sequence has, for example, at 
least 70% sequence identity to the sequence(s). 

Relative sequence identity can also be determined by commercially available 
computer programs that can calculate % identity between two or more sequences using 
any suitable algorithm for determining identity, using for example default parameters. A 
typical example of such a computer program is CLUSTAL. Other computer program 
methods to determine identify and similarity between the two sequences include but are 
not limited to the GCG program package (Devereux et al 1984 Nucleic Acids Research 
12: 387) and FASTA (Atschul et al 1990 J Molec Biol 403-410), 

% homology may be calculated over contiguous sequences, i.e. one sequence is 
aligned with the other sequence and each amino acid in one sequence is directly compared 
with the corresponding amino acid in the other sequence, one residue at a time. This is called 
an "ungapped" alignment. Typically, such ungapped alignments are performed only over a 
relatively short number of residues. 

Although this is a very simple and consistent method, it fails to take into 
consideration that, for example, in an otherwise identical pair of sequences, one insertion or 
deletion will cause the following amino acid residues to be put out of alignment, thus 
potentially resulting in a large reduction in % homology when a global alignment is 
performed. Consequently, most sequence comparison methods are designed to produce 
optimal alignments that take into consideration possible insertions and deletions without 
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penalising unduly the overall homology score. This is achieved by inserting "gaps" in the 
sequence ahgnment to try to maximise local homology. 

However, these more complex methods assign "gap penalties" to each gap that occurs 
in the alignment so that, for the same number of identical amino acids, a sequence alignment 
with as few gaps as possible - reflecting higher relatedness between the two compared 
sequences - will achieve a higher score than one with many gaps. "Affine gap costs" are 
typically used that charge a relatively high cost for the existence of a gap and a smaller 
penalty for each subsequent residue in the gap. This is the most commonly used gap scoring 
system. High gap penalties will of course produce optimised ahgnments with fewer gaps. 
Most alignment programs allow the gap penalties to be modified. However, it is preferred to 
use the default values when using such software for sequence comparisons. For example, 
when using the GCG Wisconsin Bestfit package the default gap penalty for amino acid 
sequences is -12 for a gap and -4 for each extension. 

Calculation of maximum % homology therefore firstly requires the production of an 
optimal alignment, taking into consideration gap penalties. A suitable computer program for 
carrying out such an alignment is the GCG Wisconsin Bestfit package (University of 
Wisconsin, U.S.A.; Devereux et al, 1984, Nucleic Acids Research 12:387). Examples of 
other software than can perform sequence comparisons include, but are not limited to, the 
BLAST package (Ausubel et aL, 1999 ibid- Chapter 18), FASTA (Atschul et al, 1990, J. 
Mol. Biol., 403-410) and the GENEWORKS suite of comparison tools. Both BLAST and 
FASTA are available for offline and online searching (Ausubel et al,, 1999 ibid, pages 7- 
58 to 7-60). 

Although the final % homology can be measured in terms of identity, the 
alignment process itself is typically not based on an all-or-nothing pair comparison. 
Instead, a scaled similarity score matrix is generally used that assigns scores to each 
pairwise comparison based on chemical similarity or evolutionary distance. An example of 
such a matrix commonly used is the BLOSUM62 matrix - the default matrix for the 
BLAST suite of programs. GCG Wisconsin programs generally use either the public 
default values or a custom symbol comparison table if supplied. It is preferred to use the 
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public default values for the GCG package, or in the case of other software, the default 
matrix, such as BLOSUM62. 



Advantageously, the BLAST algorithm is employed, with parameters set to default 
values. The BLAST algorithm is described in detail at 
5 http://www.ncbi.nih.gov/BLAST/blast_help.html, which is incorporated herein by 

reference. The search parameters are defined as follows, can be advantageously set to the 
defined default parameters. 

Advantageously, "substantial identity" when assessed by BLAST equates to 
sequences which match with an EXPECT value of at least about 7, preferably at least 
10 about 9 and most preferably 10 or more. The default threshold for EXPECT in BLAST 
searching is usually 1 0. 

BLAST (Basic Local Aligimient Search Tool) is the heuristic search algorithm 
employed by the programs blastp, blastn, blastx, tblastn, and tblastx; these programs 
ascribe significance to their findings using the statistical methods of Karlin and Altschul 
15 (Karlin and Altschul 1990, Proc. Natl Acad. Set USA 87:2264-68; Karlin and Altschul, 
1993, Proc. Natl Acad. Set USA 90:5873-7; see 

http://www.ncbi.nih.gov/BLAST/blast_help.html) with a few enhancements. The BLAST 
programs are tailored for sequence similarity searching, for example to identify 
homologues to a query sequence. For a discussion of basic issues in similarity searching of 
20 sequence databases, see Altschul et al (1 994) Nature Genetics 6: 1 19-129. 

The five BLAST programs available at http://wvw.ncbi.iilm.nih.gov perform the 
following tasks: blastp - compares an amino acid query sequence against a protein 
sequence database; blastn - compares a nucleotide query sequence against a nucleotide 
sequence database; blastx - compares the six-fi:ame conceptual translation products of a 
25 nucleotide query sequence (both strands) against a protein sequence database; tblastn - 
compares a protein query sequence against a nucleotide sequence database dynamically 
translated in all six reading frames (both strands); tblastx - compares the six-frame 
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translations of a nucleotide query sequence against the six-frame translations of a 
nucleotide sequence database. 

BLAST uses the following search parameters: 

HISTOGRAM - Display a histogram of scores for each search; defauh is yes. (See 
5 parameter H in the BLAST Manual). 

' DESCRIPTIONS - Restricts the number of short descriptions of matching 
^ sequences reported to the number specified; default limit is 100 descriptions. (See 

parameter V in the manual page). 

EXPECT - The statistical significance threshold for reporting matches against 
10 database sequences; the defauh value is 10, such that 10 matches are expected to be found 
merely by chance, according to the stochastic model of Karlin and Altschul (1990). If the 
statistical significance ascribed to a match is greater than the EXPECT threshold, the 
match will not be reported. Lower EXPECT thresholds are more stringent, leading to 
fewer chance matches being reported. Fractional values are acceptable. (See parameter E 
1 5 in the BLAST Manual). 

• 

CUTOFF - Cutoff score for reporting high-scoring segment pairs. The default 
value is calculated from the EXPECT value (see above). HSPs are reported for a database 
sequence only if the statistical significance ascribed to them is at least as high as would be 
ascribed to a lone HSP having a score equal to the CUTOFF value. Higher CUTOFF 
20 values are more stringent, leading to fewer chance matches being reported. (See parameter 
S in the BLAST Manual). Typically, significance thresholds can be more mtuitively 
managed using EXPECT. 
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ALIGNMENTS - Restricts database sequences to the number specified for which 
high-scoring segment pairs (HSPs) are reported; the default limit is 50. If more database 
sequences than this happen to satisfy the statistical significance threshold for reporting 
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(see EXPECT and CUTOFF below), only the matches ascribed the greatest statistical 
significance are reported. (See parameter B in the BLAST Manual). 



MATRIX - Specify an alternate scoring matrix for BLASTP, BLASTX, 
TBLASTN and TBLASTX. The default matrix is BLOSUM62 (Henikdff & Henikoff, 
5 1992). The valid alternative choices include: PAM40, PAM120, PAM250 and 

IDENTITY. No alternate scoring matrices are available for BLASTN; specifying the 
MATRIX directive in BLASTN requests returns an error response. 

STRAND - Restrict a TBLASTN search to just the top or bottom strand of the 
database sequences; or restrict a BLASTN, BLASTX or TBLASTX search to just reading 
1 0 frames on the top or bottom strand of the query sequence. 

FILTER - Mask off segments of the query sequence that have lov/ compositional 
complexity, as determined by the SEG program of Wootton & Federhen (1993) 
Computers and Chemistry 17:149-163, or segments consisting of short-periodicity intemai 
repeats, as determined by the XNU program of Claverie & States (1993) Computers and 
15 Chemistry 17:191-201, or, for BLASTN, by the DUST program of Tatusov and Lipman 
(see http://wvAv,ncbi.nlm.nih.gov). Filtering can eliminate statistically significant but 
biologically uninteresting reports from the blast output (e.g., hits against common acidic-, 
basic- or proline-rich regions), leaving the more biologically interesting regions of the 
query sequence available for specific matching against database sequences. 

20 Low complexity sequence found by a filter program is substituted using the letter 

"N" in nucleotide sequence (e.g., "NNNNNNNNNNNNN") and the letter "X" in protein 
sequences (e.g., "XXXXXXXXX"). 

Filtering is only applied to the query sequence (or its translation products), not to 
database sequences. Default fihering is DUST for BLASTN, SEG for other programs. 

25 It is not unusual for nothing at all to be masked by SEG, XNU, or both, when 

applied to sequences in SWISS-PROT, so filtering should not be expected to always yield 
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an effect. Furthermore, in some cases, sequences are masked in their entirety, indicating 
that the statistical significance of any matches reported against the unfiltered query 
sequence should be suspect. 

NCBI-gi - Causes NCBI gi identifiers to be shown in the output, in addition to the 
5 accession and/or locus name. 

Most preferably, sequence comparisons are conducted using the simple BLAST 
search algorithm provided at http://www.ncbi.nlm.nih.gov/BLAST. In some embodiments 
^ of the present invention, no gap penalties are used when determining sequence identity. 

Hybridisation 

10 The present invention also encompasses nucleotide sequences that are capable of 

hybridising to the sequences presented herein, or any firagment or derivative thereof, or to 
the complement of any of the above. 

Hybridization means a ''process by which a strand of nucleic acid joins with a 
complementary strand through base pairing" (Coombs J (1994) Dictionary of 
9 15 Biotechnology, Stockton Press, New York NY) as well as the process of amplification as 
carried out in polymerase chain reaction technologies as described in Dieffenbach CW and 
GS Dveksler (1995, PGR Primer, a Laboratory Manual, Cold Spring Harbor Press, 
Plainview NY). 

Hybridization conditions are based on the melting temperature (Tm) of the nucleic 
20 acid binding complex, as taught in Berger and Kimmel (1987, Guide to Molecular Cloning 
Techniques, Methods in Enzymology, Vol 152, Academic Press, San Diego CA), and 
confer a defined "stringency" as explained below. 
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Nucleotide sequences of the invention capable of selectively hybridising to the 
nucleotide sequences presented herein, or to their complement, will be generally at least 70%, 
preferably at least 75%, more preferably at least 85 or 90% and even more preferably at least 
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95% or 98% homologous to the corresponding nucleotide sequences presented herein over a 
region of at least 20, preferably at least 25 or. 30, for instance at least 40, 60 or 100 or more 
contiguous nucleotides. Preferred nucleotide sequences of the invention will comprise 
regions homologous to SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 7 or 
SEQ ID NO: 10, preferably at least 70%, 80% or 90% and more preferably at least 95% 
homologous to one of the sequences. 

The term "selectively hybridizable" means that the nucleotide sequence used as a 
probe is used under conditions where a target nucleotide sequence of the invention is found to 
hybridize to the probe at a level significantly above background. The background 
hybridization may occur because of other nucleotide sequences present, for example, in the 
cDNA or genomic DNA library being screened. In this event, background imphes a level of 
signal generated by interaction between the probe and a non-specific DNA member of the 
library which is less than 10 fold, preferably less than 100 fold as intense as the specific 
interaction observed v^th the target DNA, The intensity of interaction may be measured, for 
example, by radiolabelling the probe, e.g. with P. 

Also included within the scope of the present invention are nucleotide sequences 
that are capable of hybridizing to the nucleotide sequences presented herein under 
conditions of intermediate to maximal stringency. Hybridization conditions are based on 
the melting temperature (Tm) of the nucleic acid binding complex, as taught in Berger and 
Kinmiel (1987, Guide to Molecular Cloning Techniques, Methods in Enzymology, Vol 
152, Academic Press, San Diego CA), and confer a defined "stringency" as explained 
below. 

Maximum stringency typically occurs at about Tm-5°C (5°C below the Tm of the 
probe); high stringency at about 5°C to 10°C below Tm; intermediate stringency at about 
lO'^C to 20°C below Tm; and low stringency at about 20°C to 25°C below Tm. As will be 
understood by those of skill in the art, a maximum stringency hybridization can be used to 
identify or detect identical nucleotide sequences while an intermediate (or low) stringency 
hybridization can be used to identify or detect similar or related nucleotide sequences. 
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In a preferred embodiment, the present invention covers nucleotide sequences that 
can hybridise to one or more of the Mowgli GPCR nucleotide sequences of the present 
invention under stringent conditions (e.g. eS^'C and O.lxSSC {IxSSC = 0.15 M NaCl, 0.015 
M Na3 Citrate pH 7.0). Where the nucleotide sequence of the invention is double-stranded, 
both strands of the duplex, either individually or in combination, are encompassed by the 
present invention. Where the nucleotide sequence is single-stranded, it is to be understood 
that the complementary sequence of that nucleotide sequence is also included within the 
scope of the present invention. 

The present invention also encompasses nucleotide sequences that are capable of 
hybridising to the sequences that are complementary to the sequences presented herein, or 
any fragment or derivative thereof. Likewise, the present invention encompasses 
nucleotide sequences that are complementary to sequences that are capable of hybridising 
to the sequence of the present invention. These types of nucleotide sequences are 
examples of variant nucleotide sequences. In this respect, the term "variant" encompasses 
sequences that are complementary to sequences that are capable of hydridising to the 
nucleotide sequences presented herein. Preferably, however, the term "variant" 
encompasses sequences that are complementary to sequences that are capable of 
hydridising under stringent conditions (eg. 65°C and O.lxSSC {IxSSC = 0.15 MNaCl, 
0.015 Na3 citrate pH 7.0}) to the nucleotide sequences presented herein. 

Cloning of Mowgli GPCR and Homologues 

The present invention also encompasses nucleotide sequences that are 
complementary to the sequences presented here, or any fragment or derivative thereof. If 
the sequence is complementary to a fragment thereof then that sequence can be used as a 
probe to identify and clone similar GPCR sequences in other organisms etc. 

The present invention thus enables the cloning of Mowgli GPCR, its homologues 
and other structurally or .functionally related genes from human and other species such as 
mouse, pig, sheep, etc to be accomplished. Polynucleotides of the invention, which are 
identical or sufficiently identical to a nucleotide sequence contained in SEQ ID NO: 1, 
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SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 7 or SEQ ID NO: 10 or a fragment thereof 
may be used as hybridization probes for cDNA and genomic DNA, to isolate partial or 
full-length cDNAs and genomic clones encoding Mowgli GPCR from appropriate 
libraries. Such probes may also be used to isolate cDNA and genomic clones of other 
genes (including genes encoding homologues and orthologues from species other than 
human) that have sequence similarity, preferably high sequence similarity, to the Mowgli 
GPCR gene. Hybridization screening, cloning and sequencing techniques are known to 
those of skill in the art and are described in, for example, Sambrook et al {supra). 

Typically nucleotide sequences suitable for use as probes are 70% identical, 
preferably 80% identical, more preferably 90% identical, even more preferably 95% 
identical to that of the referent. The probes generally will comprise at least 15 nucleotides. 
Preferably, such probes will have at least 30 nucleotides and may have at least 50 
nucleotides. Particularly preferred probes will range between 150 and 500 nucleotides, 
more particularly about 300 nucleotides. 

In one embodiment, to obtain a polynucleotide encoding a Mowgli GPCR 
polypeptide, including homologues and orthologues from species other than human, 
comprises the steps of screening an appropriate library under stringent hybridization 
conditions with a labelled probe having the SEQ ID NO: 1,-SEQ ID NO: 2, SEQ ID NO: 
4, SEQ ID NO: 7 or SEQ ID NO: 10 or a fragment thereof and isolating partial or full- 
length cDNA and genomic clones containing said polynucleotide sequence. Such 
hybridization techniques are well known to those of skill in the art. Stringent hybridization 
conditions are as defined above or altematively conditions xmder overnight incubation at 
42 degrees C. in a solution comprising: 50% formamide, 5XSSC (150 mM NaCl, 15niM 
trisodium citrate), 50 mM sodium phosphate (pH7.6), 5XDenhardt's solution, 10% 
dextran sulphate, and 20 microgram/ml denatured, sheared salmon sperm DNA, followed 
by washing the filters in O.IXSSC at about 65 degrees C. 
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Functional Assay for Mowgli GPCR 

The cloned putative Mowgli GPCR polynucleotides may be verified by sequence 
analysis or functional assays. For example, the putative Mowgli GPCR or homologue may 
be assayed for receptor activity as follows. Capped RNA transcripts from linearized 
5 plasmid templates encoding the Mowgli receptor cDNAs of the invention are synthesized 
in vitro with RNA poljonerases in accordance with standard procedures. In vitro 
transcripts are suspended in water at a final concentration of 0.2 mg/ml. Ovarian lobes are 
removed from adult female toads, Stage V defolliculated oocytes are obtained, and RNA 
transcripts (10 ng/oocyte) are injected in a 50 nl bolus using a microinjection apparatus. 
10 Two electrode voltage clamps are used to measure the currents from individual Xenopus 
oocytes in response to agonist exposure. Recordings are made in Ca^"** free Barth's 
medium at room temperature. The Xenopus system may also be used to screen known 
ligands and tissue/cell extracts for activating ligands, as described in further detail below. 

Expression Assavs for Mowgli GPCR 

15 In order to design useful therapeutics for treating Mowgli GPCR associated 

diseases, it is useful to determine the expression profile of Mowgli (whether wild-type or a 
particular mutant). Thus, methods known in the art may be used to determine the organs, 
tissues aad cell types (as well as the developmental stages) in which Mowgli is expressed. 
For example, traditional or "electronic" Northems may be conducted. Reverse- 

20 transcriptase PGR (RT-PCR) may also be employed to assay expression of the MowgU 

gene or mutant. More sensitive methods for determining the expression profile of Mowgli 
include RNAse protection assays, as known in the art. 

Northern analysis is a laboratory technique used to detect the presence of a 
transcript of a gene and involves the hybridization of a labeled nucleotide sequence to a 
25 membrane on which RNAs from a particular cell type or tissue have been bound. 
(Sambrook, supra, ch. 7 and Ausubel, F. M. et al. supra, ch. 4 and 16.) Analogous 
computer techniques ("electronic Northems") applying BLAST may be used to search for 
identical or related molecules in nucleotide databases such as GenBank or the LIFESEQ 
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database (Incyte Pharmaceuticals). This type of analysis has advantages in that they may 
be faster than multiple membrane-based hybridizations. In addition, the sensitivity of the 
computer search can be modified to determine whether any particular match is categorized 
as exact or homologous. 

The polynucleotides and polypeptides of the present invention, including the 
probes described above, may be employed as research reagents and materials for discovery 
of treatments and diagnostics to animal and human disease, as explained in further detail 
elsewhere in this document. 

Expression of Mowgli GPCR Polypeptides 

The invention includes a process for producing a Mowgli GPCR polypeptide. The 
method comprises in general culturing a host cell comprising a nucleic acid encoding 
Mowgli GPCR polypeptide, or a homologue, variant, or derivative thereof, under suitable 
conditions (i.e., conditions in which the Mowgli GPCR polypeptide is expressed). 

In order to express a biologically active Mowgli GPCR, the nucleotide sequences 
encoding Mowgli GPCR or homologues, variants, or derivatives thereof are inserted into 
appropriate expression vector, i.e., a vector which contains the necessary elements for the 
transcription and translation of the inserted coding sequence. 

Methods which are well known to those skilled in the art are used to construct 
expression vectors containing sequences encoding Mowgli GPCR and appropriate 
transcriptional and translational control elements. These methods include in vitro 
recombinant DNA techniques, synthetic techniques, and in vivo genetic recombination. 
Such techniques are described in Sambrook, J. et al. (1989; Molecular Cloning, A 
Laboratory Manual, ch. 4, 8, and 16-17, Cold Spring Harbor Press, Plainview, N.Y.) and 
Ausubel, F. M. et al. (1995 and periodic supplements; Current Protocols in Molecular 
Biology, ch. 9, 13, and 16, John Wiley & Sons, New York, N.Y.). 



A variety of expression vector/host systems may be utilized to contain and express 
sequences encoding Mowgli GPCR. These include, but are not limited to, microorganisms 
such as bacteria transformed with recombinant bacteriophage, plasmid, or cosmid DNA 
expression vectors; yeast transformed with yeast expression vectors; insect cell systems 
5 infected with virus expression vectors (e.g., baculovirus); plant cell systems transformed 
with virus expression vectors (e.g., cauliflower mosaic virus (CaMV) or tobacco mosaic 
virus (TMV)) or with bacterial expression vectors (e.g,, Ti or pBR322 plasmids); or 
animal cell systems. The invention is not limited by the host cell employed. 

The "control elements" or "regulatory sequences" are those non-translated regions 
10 of the vector (i.e., enhancers, promoters, and 5' and 3' untranslated regions) which interact 
with host cellular proteins to carry out transcription and translation. Such elements may 
vary in their strength and specificity. Depending on the vector system and host utilized, 
any number of suitable transcription and translation elements, including constitutive and 
. inducible promoters, may be used. For example, when cloning in bacterial systems, 
15 inducible promoters such as the hybrid lacZ promoter of the BLUESCRIPT phagemid 
(Stratagene, La Jolla, Calif) or PSPORTl plasmid (GIBCO/BRL), and the like, may be 
used. The baculovirus polyhedrin promoter may be used in insect cells. Promoters or 
enhancers derived from the genomes of plant cells (e.g., heat shock, RUBISCO, and 
storage protein genes) or from plant viruses (e.g., viral promoters or leader sequences) 
20;. may be cloned into the vector. In mammalian cell systems, promoters from mammalian 
genes or from mammalian viruses are preferable. If it is necessary to generate a cell line 
that contains multiple copies of the sequence encoding Mowgli GPCR, vectors based on 
SV40 or EBV may be used with an appropriate selectable marker. 

In bacterial systems, a number of expression vectors may be selected depending 
25 upon the use intended for Mowgli GPCR. For example, when large quantities of Mowgli 
GPCR are needed for the induction of antibodies, vectors which direct high level 
expression of fusion proteins that are readily purified may be used. Such vectors include, 
but are not limited to, multifunctional E. coli cloning and expression vectors such as 
BLUESCRIPT (Stratagene), in which the sequence encoding Mowgli GPCR may be 
30 ligated into the vector in frame with sequences for the amino-terminal Met and the 
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subsequent 7 residues of p-galactosidase so that a hybrid protein is produced, pIN vectors 
(Van Heeke, G. and S. M. Schuster (1989) J. Biol. Chem. 264:5503-5509), and the like. 
pGEX vectors (Promega, Madison, Wis.) may also be used to express foreign polypeptides 
as fusion proteins with glutathione S-transferase (GST). In general, such fusion proteins 
are soluble and can easily be purified from lysed cells by adsorption to glutathione-agarose 
beads followed by elution in the presence of free glutathione. Proteins made in such 
systems may be designed to include heparin, thrombin, or. factor XA protease cleavage 
sites so that the cloned polypeptide of interest can be released from the GST moiety at 
will. 

In the yeast Saccharomyces cerevisiae, a number of vectors containing constitutive 
or inducible promoters, such as alpha factor, alcohol oxidase, and PGH, may be used. For 
reviews, see Ausubel (supra) and Grant et al. (1987; Methods Enzymol. 153:516-544). 

In cases where plant expression vectors are used, the expression of sequences 
encoding Mowgli GPCR may be driven by any of a number of promoters. For example, 
viral promoters such as the 3 5S and 198 promoters of CaMV may be used alone or in 
combination with the omega leader sequence from TMV. (Takamatsu, N. (1987) EMBO J. 
6:307-31 1 .) Alternatively, plant promoters such as the small subunit of RUBISCO or heat 
shock promoters may be used. (Coruzzi, G. et al. (1984) EMBO J. 3:1671-1680; Broglie, 
R. et al. (1984) Science 224:838-843; and Winter, J. et al. (1991) Results Probl. Cell 
Differ. 17:85-105.) These constructs can be introduced into plant cells by direct DNA 
transformation or pathogen-mediated transfection. Such techniques are described in a 
number of generally available reviews. (See, for example, Hobbs, S. or Murry, L. E. in 
McGraw Hill Yearbook of Science and Technology (1992) McGraw Hill, New York, 
N.Y.;pp. 191-196.). 

An insect system may also be used to express Mowgli GPCR. For example, in one 
such system, Autographa calif ornica nuclear polyhedrosis virus (AcNPV) is used as a 
vector to express foreign genes in Spodoptera frugiperda cells or in Trichoplusia larvae. 
The sequences encoding Mowgli GPCR may be cloned into a non-essential region of the 
virus, such as the polyhedrin gene, and placed under control of the polyhedrin promoter. 
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Successflil insertion of Mowgli GPCR will render the polyhedrin gene inactive and 
produce recombinant virus lacking coat protein. The recombinant viruses may then be 
used to infect, for example, S. frugiperda cells or Trichoplusia larvae in which Mowgli 
GPCR may be expressed. (Engelhard, E. K. et al. (1994) Proc. Nat. Acad. Sci. 91 :3224- 
5 3227.) 

In mammalian host cells, a number of viral-based expression systems may be 
utilized. In cases where an adenovirus is used as an expression vector, sequences encoding 
Mowgli GPCR may be ligated into an adenovirus transcription/translation complex 
consisting of the late promoter and tripartite leader sequence. Insertion in a non-essential 
El or E3 region of the viral genome may be used to obtain a viable vims which is capable 
of expressing Mowgli GPCR in infected host cells. (Logan, J. and T. Shenk (1984) Proc. 
Natl. Acad. Sci. 81 :3655-3659.) In addition, transcription enhancers, such as the Rous 
sarcoma virus (RS V) enhancer, may be used to increase expression in mammalian host 
cells. 

Thus, for example, the Mowgli receptors of the present invention are expressed in 
either human embryonic kidney 293 (HEK293) cells or adherent dhfr CHO cells. To 
maximize receptor expression, typically all 5' and 3' untranslated regions (UTRs) are 
removed from the receptor cDNA prior to insertion into a pCDN or pCDNA3 vector. The 
cells are transfected with individual receptor cDNAs by lipofectin and selected in the 
presence of 400 mg/ml G418. After 3 weeks of selection, individual clones are picked and 
expanded for further analysis. HEK293 or CHO cells transfected with the vector alone 
serve as negative controls. To isolate cell lines stably expressing the individual receptors, 
about 24 clones are typically selected and analyzed by Northern blot analysis. Receptor 
mRNAs are generally detectable in about 50% of the G418-resistant clones analyzed. 

25 Human artificial chromosomes (HACs) may also be employed to deliver larger 

fragments of DNA than can be contained and expressed in a plasmid. HACs of about 6 kb 
to 10 Mb are constructed and delivered via conventional delivery methods (liposomes, 
polycationic amino polymers, or vesicles) for therapeutic purposes. 
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Specific initiation signals may also be used to achieve more efficient translation of 
sequences encoding Mowgii GPCR. Such signals include the ATG initiation codon and 
adjacent sequences. In cases where sequences encoding Mowgii GPCR and its initiation 
codon and upstream sequences are inserted into the appropriate expression vector, no 
5 additional transcriptional or translational control signals may be needed. However, in 
cases where only coding sequence, or a fragment thereof, is inserted, exogenous 
translational control signals including the ATG initiation codon should be provided. 
Furthermore, the initiation codon should be in the correct reading frame to ensure 
translation of the entire insert. Exogenous translational elements and initiation codons may 
^ 10 be of various origins, both natural and synthetic. The efficiency of expression may be 
enhanced by the inclusion of enhancers appropriate for the particular cell system used, 
such as those described in the literature. (Scharf, D. et ai. (1994) Results Probl. Cell 
Differ. 20:125-162.) 

In addition, a host cell strain may be chosen for its ability to modulate expression 
15 of the inserted sequences or to process the expressed protein in the desired fashion. Such 
modifications of the polypeptide include, but are not limited to, acetylation, carboxylation, 
glycosylation, phosphorylation, lipidation, and acylation. Post-translational processing 
which cleaves a "prepro" form of the protein may also be used to facilitate correct 
9 insertion, folding, and/or function. Different host cells which have specific cellular 

20 machinery and characteristic mechanisms for post-translational activities (e.g., CHO, 
HeLa, MDCK, HEK293, and WI38), are available from the American Type Culture 
Collection (ATCC, Bethesda, Md.) and may be chosen to ensure the correct modification 
and processing of the foreign protein. 

For long term, high yield production of recombinant proteins, stable expression is 
25 preferred. For example, cell lines capable of stably expressing Mowgii GPCR can be 

transformed using expression vectors which may contain viral origins of replication and/or 
endogenous expression elements and a selectable marker gene on the same or on a 
separate vector. Following the introduction of the vector, cells may be allowed to grow for 
about 1 to 2 days in enriched media before being switched to selective media. The purpose 
30 of the selectable marker is to confer resistance to selection, and its presence allows growth 
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and recovery of cells which successfully express the introduced sequences. Resistant 
clones of stably transformed cells may be proliferated using tissue culture techniques 
appropriate to the cell type. 

Any number of selection systems may be used to recover transformed cell lines. 
These include, but are not limited to, the herpes simplex virus thymidine kinase genes 
(Wigler, M. et al. (1977) Cell 1 1:223-32) and adenine phosphoribosyltransferase genes 
(Lowy, L et al. (1980) Cell 22:817-23), which can be employed in tk' or apr' cells, 
respectively. Also, antimetaboUte, antibiotic, or herbicide resistance can be used as the 
basis for selection. For example, dhfr confers resistance to methotrexate (Wigler, M. et al. 
(1980) Proc. Natl. Acad. Sci. 77:3567-70); npt confers resistance to the aminoglycosides 
neomycin and G-418 (Colbere-Garapin, F. et al (1981) J. Mol. Biol. 150:1-14); and als or 
pat confer resistance to chlorsulfuron and phosphinotricin acetyltransferase, respectively 
(Murry, supra). Additional selectable genes have been described, for example, trpB, which 
allows cells to utilize indole in place of tryptophan, or hisD, which allows cells to utilize 
histinol in place of histidine. (Hartman, S. C. and R. C. Mulligan (1988) Proc. Natl. Acad. 
Sci. 85:8047-51.) Recently, the use of visible markers has gained popularity with such 
markers as anthocyanins, p -glucuronidase and its substrate GUS, and luciferase and its 
substrate luciferin. These markers can be used not only to identify transformants, but also 
to quantify the amount of transient or stable protein expression attributable to a specific 
vector system. (Rhodes, C. A. et al. (1995) Methods Mpl. Biol. 55:121-131.) 

Although the presence/absence of marker gene expression suggests that the gene of 
interest is also present, the presence and expression of the gene may need to be confirmed. 
For example, if the sequence encoding Mowgli GPCR is inserted within a marker gene 
sequence, transformed cells containing sequences encoding Mowgli GPCR can be 
identified by the absence of marker gene function. Altematively, a marker gene can be 
placed in tandem with a sequence encoding Mowgli GPCR under the control of a single 
promoter. Expression of the marker gene in response to induction or selection usually 
indicates expression of the tandem gene as well. 
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Alternatively, host cells which contain the nucleic acid sequence encoding Mowgli 
GPCR and express Mowgli GPCR may be identified by a variety of procedures known to 
those of skill in the art. These procedures include, but are not limited to, DNA-DNA or 
DNA-RNA hybridizations and protein bioassay or immunoassay techniques which include 
membrane, solution, or chip based technologies for the detection and/or quantification of 
nucleic acid or protein sequences. 

The presence of polynucleotide sequences encoding Mowgli GPCR can be 
detected by DNA-DNA or DNA-RNA hybridization or amplification using probes or 
fragments or fragments of polynucleotides encoding Mowgli GPCR. Nucleic acid 
amplification based assays involve the use of oligonucleotides or oligomers based on the 
sequences encoding Mowgli GPCR to detect transformants containing DNA or RNA 
encoding Mowgli GPCR. 

A variety of protocols for detecting and measuring the expression of Mowgli 
GPCR, using either polyclonal or monoclonal antibodies specific for the protein, are 
known in the art. Examples of such techniques include enzyme-linked immunosorbent 
assays (ELISAs), radioimmunoassays (RIAs), and fluorescence activated cell sorting 
(FACS). A two-site, monoclonal-based immunoassay utilizing monoclonal antibodies 
reactive to two non-interfering epitopes on Mowgli GPCR is preferred, but a competitive 
binding assay may be employed. These and other assays are well described in the art, for 
example, in Hampton, R. et al. (1990; Serological Methods, a Laboratory Manual, Section 
IV, APS Press, St Paul, Minn.) and in Maddox, D. E. et al. (1983; J. Exp. Med. 158:121 1- 
1216). 

A wide variety of labels and conjugation techniques are known by those skilled in 
the art and may be used in various nucleic acid and amino acid assays. Means for 
producing labeled hybridization or PCR probes for detecting sequences related to 
polynucleotides encoding Mowgli GPCR include oligolabeling, nick translation, end- 
labeling, or PCR amplification using a labeled nucleotide. Alternatively, the sequences 
encoding Mowgli GPCR, or any fragments thereof, may be cloned into a vector for the 
production of an mRNA probe. Such vectors are known in the art, are commercially 
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available, and may be used to synthesize RNA probes in vitro by addition of an 
appropriate RNA polymerase such as T7, T3, or SP6 and labeled nucleotides. These 
procedures may be conducted using a variety of commercially available kits, such as those 
provided by Pharmacia & Upjohn (Kalamazoo, Mich.), Promega (Madison, Wis.), and 
5 U.S. Biochemical Corp. (Cleveland, Ohio). Suitable reporter molecules or labels which 
may be used for ease of detection include radionuclides, enzymes, fluorescent, 
' chemiluminescent, or chromogenic agents, as well as substrates, cofactors, inhibitors, 
magnetic particles, and the like. 

1^ Host cells transformed with nucleotide sequences encoding Mowgli GPCR may be 

1 0 cultured under conditions suitable for the expression and recovery of the protein from cell 
culture. The protein produced by a transformed cell may be located in the cell membrane, 
secreted or contained intracellularly depending on the sequence and/or the vector used. As 
will be understood by those of skill in the art, expression vectors containing 
polynucleotides which encode Mowgli GPCR may be designed to contain signal 

1 5 sequences which direct secretion of Mowgli GPCR through a prokaryotic or eukaryotic 
cell membrane. Other constructions may be used to join sequences encoding Mowgli 
GPCR to nucleotide sequences encoding a polypeptide domain which will facilitate 
purification of soluble proteins. Such purification facilitating domains include, but are not 
^ limited to, metal chelating peptides such as histidine-tryptophan modules that allow 

20 purification on immobilized metals, protein A domains that allow purification on 

immobilized immunoglobulin, and the domain utilized in the FLAGS extension/affinity 
purification system (Immunex Corp., Seattle, Wash.). The inclusion of cleavable linker 
sequences, such as those specific for Factor XA or enterokinase (Invitrogen, San Diego, 
Calif), between the purification domain and the Mowgli GPCR encoding sequence may 

25 be used to facilitate purification. One such expression vector provides for expression of a 
fusion protein containing Mowgli GPCR and a nucleic acid encoding 6 histidine residues 
preceding a thioredoxin or an enterokinase cleavage site. The histidine residues facilitate 
purification on immobilized metal ion affinity chromatography (IMIAC; described in 
Porath, J. et al. (1992) Prot. Exp. Purif 3: 263-281), while the enterokinase cleavage site 

30 provides a means for purifying Mowgli GPCR from the fiision protein. A discussion of 
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vectors which contain fusion proteins is provided in KroU, D. J. et al. (1993; DNA Cell 
Biol. 12:441-453). 

Fragments of Mowgli GPCR may be produced not only by recombinant 
production, but also by direct peptide synthesis using solid-phase techniques. (Merrifield 
5 J. (1963) J. Am. Chem, Soc. 85:2149-2154.) Protein synthesis may be performed by 

manual techniques or by automation. Automated synthesis may be achieved, for example, 
using the Applied Biosystems 431 A peptide synthesizer (Perkin Elmer). Various 
fragments of Mowgli GPCR may be synthesized separately and then combined to produce 
^ the full length molecule. 

10 Biosensors 

The Mowgli polypeptides, nucleic acids, probes, antibodies, expression vectors and 
ligands are useful as (and for the production of) biosensors. 

According to Aizawa (1988), Anal Chem. Symp. 17: 683, a biosensor is defined as 
being a unique combination of a receptor for molecular recognition, for example a 
15 selective layer with immobilized antibodies or receptors such as a Mowgli G-protein 
^ coupled receptor, and a transducer for transmitting the values measured. One group of 

such biosensors will detect the change which is caused in the optical properties of a 
surface layer due to the interaction of the receptor with the surrounding medium. Among 
such techniques may be mentioned especially ellipso-metry and surface piasmon 
20 resonance. Biosensors incorporating Mowgli may be used to detect the presence or level 
of Mowgli ligands, for example, nucleotides such as purines or purine analogues, or 
analogues of these ligands. The constmction of such biosensors is well known in the art. 

Thus, cell lines expressing Mowgli receptor may be used as reporter systems for 
detection of ligands such as ATP via receptor-promoted formation of [3H]inositol 
25 phosphates or other second messengers (Watt et al., 1998, J Biol Chem May 

29;273(22): 14053-8). Receptor-ligand biosensors are also described in Hoffman et aL, 
2000, Proc Natl Acad Sci US A Oct 10;97(21):1 1215-20. Optical and other biosensors 
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comprising Mowgli may also be used to detect the level or presence of interaction with G- 
proteins and other proteins, as described by, for example, Figler et al, 1997, Biochemistry 
Dec 23;36(51):16288-99 and Sarrio et al., 2000, Mol Cell Biol 2000 Jul;20(14):5 164-74). 
Sensor units for biosensors are described in, for example, US 5,492,840. 

5 Screening Assays 

The Mowgli . GPCR polypeptide of the present invention, including homologues, 
variants, and derivatives, whether natural or recombinant, may be employed in a screening 
process for compounds which bind the receptor and which activate (agonists) or inhibit 
activation of (antagonists) of Mowgli. Thus, polypeptides of the invention may also be 
1 0 used to assess the binding of small molecule substrates and ligands in, for example, cells, 
cell-free preparations, chemical libraries, and natural product mixtures. These substrates 
and ligands may be natural substrates and ligands or may be structural or functional 
mimetics. See Coligan et al.. Current Protocols in Immunology l(2):Chapter 5 (1991). 

Mowgli GPCR polypeptides are responsible for many biological functions, 
15 including many pathologies. Accordingly, it is desirous to find compounds and drugs 
which stimulate Mowgli GPCR on the one hand and which can inhibit the function of 
Mowgli GPCR on the other hand. In general, agonists and antagonists are employed for 
therapeutic and prophylactic purposes for such conditions as infections such as bacterial, 
fungal, protozoan and viral infections, particularly infections caused by HIV-1 or HIV-2; 
20 pain; cancers; diabetes, obesity; anorexia; bulimia; asthma; Parkinson's disease; 

thrombosis; acute heart failure; hypotension; hypertension; erectile dysfunction; urinary 
retention; metabolic bone diseases such as osteoporisis and osteo petrosis; angina pectoris; 
myocardial infarction; ulcers; asthma; allergies; rheumatoid arthritis; inflammatory bowel 
disease; irritable bowel syndrome benign prostatic hypertrophy; and psychotic and 
25 neurological disorders, including anxiety, schizophrenia, manic depression, delirium, 
dementia, severe mental retardation and dyskinesias, such as Huntington's disease or 
Gilles dela Tourett's syndrome. 
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In particular, agonists and antagonists of Mowgli GPCR may be used to treat or 
prevent platelet ADP receptor defect, autosomal recessive deafness, myotonic dystrophy, 
essential hypertension. Usher syndrome, in particular type 3 Usher syndrome, juvenile- 
onset cataract, congenital cataract, and 3-methylcrotonylglycinuria I. 

5 Rational design of candidate compounds likely to be able to interact with Mowgli 

GPCR protein may be based upon structural studies of the molecular shapes of a 
polypeptide according to the invention. One means for determining which sites interact 
with specific other proteins is a physical structure determination, e.g.. X-ray 
^ crystallography or two-dimensional NMR techniques. These will provide guidance as to 

10 which amino acid residues form molecular contact regions. For a detailed description of 
protein structural determination, see, e.g., Blundell and Johnson (1976) Protein 
Crystallography, Academic Press, New York. 

An alternative to rational design uses a screening procedure which involves in 
general producing appropriate cells which express the Mowgli receptor polypeptide of the 

15 present invention on the surface thereof. Such cells include cells from animals, yeast, 
Drosophila or coli. Cells expressing the receptor (or cell membrane containing the 
expressed receptor) are then contacted with a test compound to observe binding, or 
stimulation or inhibition of a functional response. For example, Xenopus oocytes may be 
injected with Mowgli mRNA or polypeptide, and currents induced by exposure to test 

20 compounds measured by use of voltage clamps measured, as described in further detail 
elsewhere. 



Furthermore, microphysiometric assays may be employed to assay Mowgli 
receptor activity. Activation of a wide variety of secondary messenger systems results in 
extrusion of small amounts of acid from a cell. The acid formed is largely as a result of the 
25 increased metabolic activity required to fiiel the intracellular signalling process. The pH 
changes in the media surrounding the cell are very small but are detectable by, for 
example, the CYTOSENSOR microphysiometer (Molecular Devices Ltd., Menlo Park, 
Calif.). The CYTOSENSOR is thus capable of detecting the activation of a receptor which 
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is coupled to an energy utilizing intracellular signaling pathway such as the G-protein 
coupled receptor of the present invention. 

Instead of testing each candidate compound individually with the Mowgli receptor, 
a library or bank of candidate ligands may advantageously be produced and screened. 
Thus, for example, a bank of over 200 putative receptor ligands has been assembled for 
screening. The bank comprises: transmitters, hormones and chemokines known to act via a 
human seven transmembrane (7TM) receptor; naturally occurring compounds which may 
be putative agonists for a human 7TM receptor, non-mammalian, biologically active 
peptides for which a mammalian counterpart has not yet been identified; and compounds 
not found in nature, but which activate 7TM receptors with unknown natural ligands. This 
bank is used to screen the receptor for known ligands, using both functional (i.e. calcium, 
cAMP, microphysiometer, oocyte electrophysiology, etc, see elsewhere) as well as 
binding assays as described in further detail elsewhere. However, a large number of 
mammalian receptors exist for which there remains, as yet, no cognate activating ligand 
(agonist) or deactivating ligand (antagonist). Thus, active ligands for these receptors may 
not be included within the ligands banks as identified to date. Accordingly, theMowgli 
receptor of the invention is also functionally screened (using calcium, cAMP, 
microphysiometer, oocyte electrophysiology, etc., functional screens) against tissue 
extracts to identify natural ligands. Extracts that produce positive functional responses can 
be sequentially subfractionated, with the fractions being assayed as described here, until 
an activating ligand is isolated and identified. 

7TM receptors which are expressed in HEK 293 cells have been shown to be 
coupled functionally to activation of PLC and calcium mobilization and/or cAMP 
stimuation or inhibition. One screening technique therefore includes the use of cells which 
express the Mowgli GPCR receptor of this invention (for example, transfected Xenopus 
oocytes, CHO or HEK293 ceils) in a system which measures extracellular pH or 
intracellular calcium changes caused by receptor activation. In this technique, compoimds 
may be contacted v^th cells expressing the receptor polypeptide of the present invention. 
A second messenger response, e.g., signal transduction, pH changes, or changes in 
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calcium level, is then measured to determine whether the potential compound activates or 
inhibits the receptor. 



In such experiments, basal calcium levels in the HEK 293 cells in receptor- 
transfected or vector control cells are observed to be in the normal, 100 nM to 200 nM, 
5 range, HEK 293 cells expressing Mowgli GPCR or recombinant Mowgli GPCR are loaded 
with fura 2 and in a single day more than 150 selected ligands or tissue/cell extracts are 
evaluated for agonist induced calciurii mobilization. Similarly, HEK 293 cells expressing 
Mowgli GPCR or recombinant Mowgli GPCR are evaluated for the stimulation or 
inhibition of cAMP production using standard cAMP quantitation assays. Agonists 
10 presenting a calcium transient or cAiVIP fluctuation are tested in vector control cells to 
determine if the response is unique to the transfected cells expressing receptor. 

Another method involves screening for receptor inhibitors by determining 
inhibition or stimulation of Mowgli receptor-mediated cAMP and/or adenylate cyclase 
accimiulation. Such a method involves transfecting a eukaryotic cell with the receptor of 
15 this invention to express the receptor on the cell surface. The cell is then exposed to 
potential antagonists in the presence of the receptor of this invention. The amount of 
cAMP accumulation is then measured. If the potential antagonist binds the receptor, and 
thus inhibits receptor binding, the levels of receptor-mediated cAMP, or adenylate cyclase, 
activity will be reduced or increased. 

20 Another method for detecting agonists or antagonists for the receptor of the present 

invention is the yeast based technology as described in U.S. Pat. No. 5,482,835, 
incorporated by reference herein. 

Where the candidate compounds are proteins, in particular antibodies or peptides, 
libraries of candidate compounds may be screened using phage display techniques. Phage 
25 display is a protocol of molecular screening which utilises recombinant bacteriophage. 

The technology involves transforming bacteriophage with a gene that encodes one 
compound from the library of candidate compounds, such that each phage or phagemid 
expresses a particular candidate compound. The transformed bacteriophage (which 



) 
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preferably is tethered to a solid support) expresses the appropriate candidate compound 
and displays it on their phage coat. Specific candidate compounds which are capable of 
binding to a polypeptide or peptide of the invention are enriched by selection strategies 
based on affinity interaction. The successful candidate agents are then characterised. 
5 Phage display has advantages over standard affinity ligand screening technologies. The 
phage surface displays the candidate agent in a three dimensional configuration, more 
closely resembling its naturally occurring conformation. This allows for more specific and 
higher affinity binding for screening purposes. 

Another method of screening a library of compounds utilises eukaryotic or 
10 prokaryotic host cells which are stably transformed with recombinant DNA molecules 

expressing a library of compounds. Such cells, either in viable or fixed form, can be used 
for standard binding-partner assays. See also Parce et al (1989) Science 246:243-247; and 
Owicki et al. (1990) Proc. Nat'l Acad. Sci. USA 87;4007-401 1, which describe sensitive 
methods to detect cellular responses. Competitive assays are particularly useful, where the 
1 5 . cells expressing the library of compounds are contacted or incubated with a labelled 
antibody known to bind to a Mowgli polypeptide of the present invention, such as ^^^I- 
antibody, and a test sample such as a candidate compoimd whose binding affinity to the 
binding composition is being measured. The bound and firee labelled binding partners for 
the polypeptide are then separated to assess the degree of binding. The amount of test 
20 sample bound is inversely proportional to the amount of labelled antibody binding to the 
polypeptide. 

Any one of numerous techniques can be used to separate bound from free binding 
partners to assess the degree of binding. This separation step could typically involve a 
procedure such as adhesion to filters followed by washing, adhesion to plastic following 
25 by washing, or centrifugation of the cell membranes. 

Still another approach is to use solubilized, unpurified or solubilized purified 
polypeptide or peptides, for example extracted from transformed eukaryotic or prokaryotic 
host cells. This allows for a "molecular" binding assay with the advantages of increased 
specificity, the ability to automate, and high drug test throughput. 
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Another technique for candidate compound screening involves an approach which 
provides high throughput screening for new compounds having suitable binding affinity, 
e.g., to a polypeptide of the invention, and is described in detail in International Patent 
application no. WO 84/03564 (Commonwealth Serum Labs.), pubUshed on September 13 
5 1 984. First, large numbers of different small peptide test compounds are synthesized on a 
solid substrate, e.g., plastic pins or some other appropriate svirface; see Fodor et al (1991). 
Then all the pins are reacted with solubilized polypeptide of the invention and washed. 
The next step involves detecting bound polypeptide. Compounds which interact 
specifically with the polypeptide will thus be identified. 

10 Ligand binding assays provide a direct method for ascertammg receptor 

pharmacology and are adaptable to a high throughput format. The purified ligand for a 
receptor may be radiolabeled to high specific activity (50-2000 Ci/mmol) for binding 
studies. A determination is then made that the process of radiolabeling does not diminish 
the activity of the ligand towards its receptor. Assay conditions for buffers, ions, pH and 

1 5 other modulators such as nucleotides are optimized to establish a workable signal to noise 
ratio for both membrane and whole cell receptor sources. For these assays, specific 
receptor binding is defined as total associated radioactivity minus the radioactivity 
measured in the presence of an excess of unlabeled competing ligand. Where possible, 
more than one competing ligand is used to define residual nonspecific binding. 

20 The assays may simply test binding of a candidate compound wherein adherence to 

the cells bearing the receptor is detected by means of a label directly or indirectly 
associated with the candidate compound or in an assay involving competition with a 
labeled competitor. Further, these assays may test whether the candidate compound results 
in a signal generated by activation of the receptor, using detection systems appropriate to 

25 the cells bearing the receptor at their surfaces. Inhibitors of activation are generally 

assayed in the presence of a known agonist and the effect on activation by the agonist by 
the presence of the candidate compound is observed. 



Further, the assays may simply comprise the steps of mixing a candidate 
compound with a solution containing a Mowgli GPCR polypeptide to form a mixture, 
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measuring Mowgli GPCR activity in the mixture, and comparing the Mowgii GPCR 
activity of the mixture to a standard. 

The Mowgli GPCR cDNA, protein and antibodies to the protein may also be used 
to configure assays for detecting the effect of added compounds on the production of 
5 Mowgli GPCR mRNA and protein in cells. For example, an ELISA may be constructed 
for measuring secreted or cell associated levels of Mowgli GPCR protein using 
monoclonal and polyclonal antibodies by standard methods known in the art, and this can 
be used to discover agents which may inhibit or enhance the production of Mowgli GPCR 
(also called antagonist or agonist, respectively) from suitably manipulated cells or tissues. 
10 Standard methods for conducting screening assays are well understood in the art. 

Examples of potential Mowgli GPCR antagonists include antibodies or, in some 
cases, nucleotides and their analogues, including purines and purine analogues, 
oligonucleotides or proteins which are closely related to the ligand of the Mowgli GPCR, 
e.g., a fi-agment of the ligand, or small molecules which bind to the receptor but do not 
15 elicit a response, so that the activity of the receptor is prevented. 

The present invention therefore also provides a compound capable of binding 
specifically to a Mowgli polypeptide and/or peptide of the present invention. 

The term "compound" refers to a chemical compound (naturally occurring or 
synthesised), such as a biological macromolecule (e.g., nucleic acid, protein, non-peptide, 
20 or organic molecule), or an extract made from biological materials such as bacteria, plants, 
fungi, or animal (particularly mammalian) cells or tissues, or even an inorganic element or 
molecule. Preferably the compound is an antibody. 



The materials necessary for such screening to be conducted may be packaged into 
a screening kit. Such a screening kit is useful for identifying agonists, antagonists, ligands, 
25 receptors, substrates, enzymes, etc. for MowgU GPCR polypeptides or conipounds which 
decrease or enhance the production of Mowgli GPCR polypeptides. The screening kit 
comprises: (a) a Mowgli GPCR polypeptide; (b) a recombinant cell expressing a Mowgli 
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GPCR polypeptide; (c) a cell membrane expressing a Mowgli GPCR polypeptide; or (d) 
antibody to a Mowgli GPCR polypeptide. The screening kit may optionally comprise 
instructions for use. 

Transgenic Animals 

The present invention further encompasses transgenic animals capable of 
expressing natural or recombinant Mowgli GPCR, or a homologue, variant or derivative, 
at elevated or reduced levels compared to the normal expression level. Included are 
transgenic animals ("Mowgli knockout"s) which do not express functional Mowgli 
receptor. The Mowgli knockouts may arise as a result offunctional disruption of the 
Mowgli gene or any portion of that gene, including one or more loss of function 
mutations, including a deletion or replacement, of the Mowgli gene.The mutations include 
single point mutations, and may target coding or non-codingregions of Mowgli. 

Preferably, such a transgenic animal is a non-human mammal, such as a pig, a 
sheep or a rodent. Most preferably the transgenic animal is a mouse or a rat. Such 
transgenic animals may be used in screening procedures to identify agonists and/or 
antagonists of Mowgli GPCR, as well as to test for their efficacy as treatments for diseases 
in vivo. 

For example, transgenic animals that have been engineered to be deficient in the 
production of Mowgli GPCR may be used in assays to identify agonists and/or antagonists 
of Mowgli GPCR. One assay is designed to evaluate a potential drug (aa candidate ligand 
or compound) to determine if it produces a physiological response in the absence of 
Mowgli GPCR receptors. This may be accomplished by administering the drug to a 
transgenic animal as discussed above, and then assaying the animal for a particular 
response. Although any physiological parameter could be measured in this assay, preferred 
responses include one or more of the following: changes to disease resistance; altered 
inflammatory responses; altered tumour susceptability: a change in blood pressure; 
neovascularization; a change in eating behavior; a change in body weight; a change in 
bone density; a change in body temperature; insulin secretion; gonadotropin secretion; 
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nasal and bronchial secretion; vasoconstriction; loss of memory; anxiety; hyporeflexia or 
hyperreflexia; pain or stress responses. 

Tissues derived from the Mowgli knockout animals may be used in receptor 
binding assays to determine whether the potential drug (a candidate ligand or compound) 
binds to the Mowgli receptor. Such assays can be conducted by obtaining a first receptor 
preparation from the transgenic animal engineered to be deficient in Mowgli receptor 
production and a second receptor preparation from a source known to bind any identified 
Mowgli ligands or compounds. In general, the first and second receptor preparations will 
be similar in all respects except for the source from which they are obtained. For example, 
if brain tissue from a transgenic animal (such as described above and below) is used in an 
assay, comparable brain tissue from a normal (wild type) animal is used as the source of 
the second receptor preparation. Each of the receptor preparations is incubated with a 
ligand known to bind to Mowgli receptors, both alone and in the presence of the candidate 
ligand or compound. Preferably, the candidate ligand or compound will be examined at 
several different concentrations. 

The extent to which binding by the known ligand is displaced by the test 
compound is determined for both the first and second receptor preparations. Tissues 
derived from transgenic animals may be used in assays directly or the tissues may be 
processed to isolate membranes or membrane proteins, which are themselves used in the 
assays. A preferred transgenic animal is the mouse. The ligand may be labeled using any 
means compatible with binding assays. This would include, without limitation, 
radioactive, enzymatic, fluorescent or chemiluminescent labeling (as well as other 
labelling techniques as described in fiuther detail above). 

Furthermore, antagonists of Mowgli GPCR receptor may be identified by 
administering candidate compounds, etc, to wild type animals expressing ftmctional 
Mowgli, and animals identified which exhibit any of the phenotypic characteristics 
associated with reduced or abolished expression of Mowgli receptor fimction. 
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Detailed methods for generating non-human transgenic animal are described in 
further detail below. Transgenic gene constructs can be introduced into the germ line of an 
animal to make a transgenic mammal. For example, one or several copies of the construct 
may be incorporated into the genome of a mammalian embryo by standard transgenic 
techniques. 

In an exemplary embodiment, the transgenic non-human animals of the invention 
are produced by introducing transgenes into the germline of the non-human animal. 
Embryonal target cells at various developmental stages can be used to introduce 
transgenes. Different methods are used depending on the stage of development of the 
embryonal target cell. The specific line(s) of any animal used to practice this invention are 
selected for general good health, good embryo yields, good pronuclear visibility in the 
embryo, and good reproductive fitness. In addition, the haplotype is a significant factor. 

Introduction of the transgene into the embryo can be accomplished by any means 
known in the art such as, for example, microinjection, electroporation, or lipofection. For 
example, the Mowgli receptor transgene can be introduced into a mammal by 
microinjection of the construct into the pronuclei of the fertilized mammalian egg(s) to 
cause one or more copies of the construct to be retained in the cells of the developing 
mamLmal(s). Following introduction of the transgene construct into the fertilized egg, the 
egg may be incubated in vitro for varying amounts of time, or reimplanted into the 
surrogate host, or both. In vitro incubation to maturity is within the scope of this invention. 
One common method in to incubate the embryos in vitro for about 1-7 days, depending on 
the species, and then reimplant them into the surrogate host. 

The progeny of the transgenically manipulated embryos can be tested for the 
presence of the construct by Southem blot analysis of the segment of tissue. If one or more 
copies of the exogenous cloned construct remains stably integrated into the genome of 
such transgenic embryos, it is possible to establish permanent transgenic mammal lines 
carrying the transgenically added construct. 
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The litters of transgenically altered mammals can be assayed after birth for the 
incorporation of the construct into the genome of the offspring. Preferably, this assay is 
accomplished by hybridizing a probe corresponding to the DNA sequence coding for the 
desired recombinant protein product or a segment thereof onto chromosomal material from 
the progeny. Those mammalian progeny found to contain at least one copy of the construct 
in their genome are grown to maturity. 

For the purposes of this invention a zygote is essentially the formation of a diploid 
cell which is capable of developing into a complete organism. Generally, the zygote will 
be comprised of an egg containing a nucleus formed, either naturally or artificially, by the 
fusion of two haploid nuclei from a gamete or gametes. Thus, the gamete nuclei must be 
ones which are naturally compatible, i.e., ones which result in a viable zygote capable of 
undergoing differentiation and developing into a functioning organism. Generally, a 
euploid zygote is preferred. If an aneuploid zygote is obtained, then the number of 
chromosomes should not vary by more than one with respect to the euploid number of the 
organism from which either gamete originated. 

In addition to similar biological considerations, physical ones also govern the 
amount (e.g., volume) of exogenous genetic material which can be added to the nucleus of 
the zygote or to the genetic material which forms a part of the zygote nucleus. If no 
genetic material is removed, then the amount of exogenous genetic material which can be 
added is limited by the amount which will be absorbed without being physically 
disruptive. Generally, the volume of exogenous genetic material inserted will not exceed 
about 10 picoliters. The physical effects of addition must not be so great as to physically 
destroy the viability of the zygote. The biological limit of the number and variety of DNA 
sequences will vary depending upon the particular zygote and functions of the exogenous 
genetic material and will be readily apparent to one skilled in the art, because the genetic 
material, including the exogenous genetic material, of the resulting zygote must be 
biologically capable of initiating and maintaining the differentiation and development of 
the zygote into a functional organism. 
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The number of copies of the transgene constructs which are added to the zygote is 
dependent upon the total amount of exogenous genetic material added and will be the 
amount which enables the genetic transformation to occur. Theoretically only one copy is 
required; however, generally, numerous copies are utilized, for example, 1,000-20,000 
copies of the transgene construct, in order to insure that one copy is functional. As regards 
the present invention, there will often be an advantage to having more than one 
functioning copy of each of the inserted exogenous DNA sequences to enhance the 
phenotypic expression of the exogenous DNA sequences. 

Any technique which allows for the addition of the exogenous genetic material into 
nucleic genetic material can be utilized so long as it is not destructive to the cell, nuclear 
membrane or other existing cellular or genetic structures. The exogenous genetic material 
is preferentially inserted into the nucleic genetic material by microinjection. 
Microinjection of cells and cellular structures is known and is used in the art. 

Reimplantation is accomplished using standard methods. Usually, the surrogate 
host is anesthetized, and the embryos are inserted into the oviduct. The number of 
embryos implanted into a particular host will vary by species, but will usually be 
comparable to the number of off spring the species naturally produces. 

Transgenic offspring of the surrogate host may be screened for the presence and/or 
expression of the transgene by any suitable method. Screening is often accomplished by 
Southern blot or Northern blot analysis, using a probe that is complementary to at least a 
portion of the transgene. Westem blot analysis using an antibody against the protein 
encoded by the transgene may be employed as an altemative or additional method for 
screening for the presence of the transgene product. Typically, DNA is prepared from tail 
tissue and analyzed by Southern analysis or PGR for the transgene. Altematively, the 
tissues or cells believed to express the transgene at the highest levels are tested fgr the 
presence and expression of the transgene using Southern analysis or PGR, although any 
tissues or cell types may be used for this analysis. 



Alternative or additional methods for evaluating the presence of the transgene 
include, without limitation, suitable biochemical assays such as enzyme and/or 
immunological assays, histological stains for particular marker or enzyme activities, flow 
cytometric analysis, and the like. Analysis of the blood may also be useful to detect the 
presence of the transgene product in the blood, as well as to evaluate the effect of the 
transgene on the levels of various types of blood cells and other blood constituents. 

Progeny of the transgenic animals may be obtained by mating the transgenic 
animal with a suitable partner, or by in vitro fertilization of eggs and/or sperm obtained 
from the transgenic animal. Where mating with a partner is to be performed, the partner 
may or may not be transgenic and/or a knockout; where it is transgenic, it may contain the 
same or a different transgene, or both. Alternatively, the partner may be a parental line. 
Where in vitro fertilization is used, the fertilized embryo may be implanted into a 
surrogate host or incubated in vitro, or both. Using either method, the progeny may be 
evaluated for the presence of the transgene using methods described above, or other 
appropriate methods. 

The transgenic animals produced in accordance with the present invention will 
include exogenous genetic material. As set out above, the exogenous genetic material will, 
in certain embodiments, be a DNA sequence which results in the production of a Mowgli 
GPCR receptor. Further, in such embodiments the sequence will be attached to a 
transcriptional control element, e.g., a promoter, which preferably allows the expression of 
the transgene product in a specific type of cell. 

Retroviral infection can also be used to introduce transgene into a non-human 
animal. The developing non-human embryo can be cultured in vitro to the blastocyst stage. 
During this time, the blastomeres can be targets for retroviral infection (Jaenich, R. (1976) 
PNAS 73:1260-1264). Efficient infection of the blastomeres is obtained by enzymatic 
treatment to remove the zona pellucida (Manipulating the Mouse Embryo, Hogan eds. 
(Cold Spring Harbor Laboratory Press, Cold Spring Harbor, 1986). The viral vector 
system used to introduce the transgene is typically a replication-defective retrovirus 
carrying the transgene (Jahner et al. (1985) PNAS 82:6927-6931; Van der Putten et al. 
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(1985) PNAS 82:6148-6152). Transfection is easily and efficiently obtained by culturing 
the blastomeres on a monolayer of virus-producing cells (Van der Putten, supra; Stewart et 
al. (1987) EMBO J. 6:383-388). Alternatively, infection can be performed at a later stage. 
Virus or virus-producing cells can be injected into the blastocoele (Jahner et al. (1982) 
5 Nature 298:623-628). Most of the founders will be mosaic for the transgene since 

incorporation occurs only in a subset of the cells which formed the transgenic non-human 
animal. Further, the founder may contain various retroviral insertions of the transgene at 
different positions in the genome which generally will segregate in the offspring. In 
addition, it is also possible to introduce transgenes into the germ line by intrauterine 
^ 1 0 retroviral infection of the midgestation embryo (Jahner et al. (1 982) supra). 

A third type of target cell for transgene introduction is the embryonal stem cell 
(ES). ES cells are obtained from pre-implantation embryos cultured in vitro and fused with 
embryos (Evans et al. (1981) Nature 292:154-156; Bradley et al. (1984) Nature 309:255- 
258; Gossler et al. (1986) PNAS 83: 9065-9069; and Robertson et al. (1986) Nature 
1 5 322:445-448). Transgenes can be efficiently introduced into the ES cells by DNA 
transfection or by retrovirus-mediated transduction. Such transformed ES cells can 
thereafter be combined with blastocysts from a non-human animal. The ES cells thereafter 
colonize the embryo and contribute to the germ line of the resulting chimeric animal. For 
^ review see Jaenisch, R. (1988) Science 240: 1468-1474. 

20 We also provide non-human transgenic animals, where the transgenic animal is 

characterized by having an altered Mowgli gene, preferably as described above, as models 
for Mowgli receptor function. Alterations to the gene include deletions or other loss of 
function mutations, introduction of an exogenous gene having a nucleotide sequence with 
targeted or random mutations, introduction of an exogenous gene from another species, or 

25 a combination thereof. The transgenic animals may be either homozygous or heterozygous 
for the alteration. The animals and cells derived therefrom are usefiil for screening 
biologically active agents that may modulate Mowglireceptor function. The screening 
methods are of particular use for determining the specificity and action of potential 
therapies for infections such as bacterial, fungal, protozoan and viral infections, 

30 particularly infections caused by HIV-1 or HIV-2; pain; cancers; diabetes, obesity; 
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anorexia; bulimia; asthma; Parkinson's disease; thrombosis; acute heart failure; 
hypotension; hypertension; erectile dysfunction; urinary retention; metabolic bone 
diseases such as osteoporisis and osteo petrosis; angina pectoris; myocardial infarction; 
ulcers; asthma; allergies; rheumatoid arthritis; inflammatory bowel disease; irritable bowel 
syndrome benign prostatic hypertrophy; and psychotic and neurological disorders, 
including anxiety, schizophrenia, manic depression, delirium, dementia, severe mental 
retardation and dyskinesias, such as Huntington's disease or Gilles dela Tourett's 
syndrome. Furthermore, the screening methods are useful for determining the specificity 
and action of therapies for diseases or syndromes such as DP receptor defect, autosomal 
recessive deafness, myotonic dystrophy, essential hypertension. Usher syndrome, in 
particular type 3 Usher syndrome, juvenile-onset cataract, congenital cataract, and 3- 
methylcrotonylglycinuria L 

The animals are useful as a model to investigate the role of Mowgli receptors in 
normal brain, heart, spleen and liver function. 

Another aspect of the invention pertains to a transgenic nonhuman animal having a 
functionally disrupted endogenous Mowgli gene but which also carries in its genome, and 
expresses, a transgene encoding a heterologous Mowgli protein (i.e., a Mowgli from 
another species). Preferably, the animal is a mouse and the heterologous Mowgli is a 
human Mowgli. An animal, or cell lines derived from such an animal of the invention, 
which has been reconstituted with human Mowgli, can be used to identify agents that 
inhibit human Mowgli in vivo and in vitro. For example, a stimulus that induces signalling 
through human Mowgli can be administered to the animal, or cell line, in the presence and 
absence of an agent to be tested and the response in the animal, or cell line, can be 
measvu-ed. An agent that inhibits human Mowgli in vivo or in vitro can be identified based 
upon a decreased response in the presence of the agent compared to the response in the 
absence of the agent. 

The present invention also provides for a Mowgli GPCR deficient transgenic non- 
human animal (a "Mowgli GPCR knock-out"). Such an animal is one which expresses 
lowered or no Mowgli GPCR activity, preferably as a result of an endogenous Mowgli 
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GPCR genomic sequence being disrupted or deleted. The endogenous Mowgli GPCR 
genomic sequence may be replaced by a null allele, which may comprise non- functional 
portions of the wild-type Mowgli sequence. For example, the endogenous Mowgli GPCR 
genomic sequence may be replaced by an allele of Mowgli comprising a disrupting 
5 sequence which may comprise heterologous sequences, for example, reporter sequences 
and/pr selectable markers. Preferably, the endogenous Mowgli GPCR genomic sequence 
in a Mowgli knock-out mouse is replaced by an allele of Mowgli in which one or more, 
preferably all, of the transmembrane sequences is replaced by such a disrupting sequence, 
preferably a lacZ sequence and a neomycin resistance sequence. Preferably, the genomic 
10 Mowgli sequence which is functionally disrupted comprises a mouse Mowgli genomic 
sequence shown in SEQ ID NO: 10. 

Preferably, such an animal expresses no GPCR activity. More preferably, the 
animal expresses no activity of the MowgU GPCR shown as SEQ ID NO: 3, SEQ ID NO: 
5, SEQ ID NO: 6 or SEQ ID NO: 8. Mowgli GPCR knock-outs may be generated by 
15 various means known in the art, as described in further detail below, 

A specific description of the construction of a Mowgli knock-out mouse is 
disclosed in Examples 1 et seq. 

The present invention also pertains to a nucleic acid construct for functionally 
disrupting a Mowgli gene in a host cell. The nucleic acid construct comprises: a) a non- 
20 homologous replacement portion; b) a first homology region located upstream of the non- 
homologous replacement portion, the first homology region having a nucleotide sequence 
with substantial identity to a first Mowgli gene sequence; and c) a second homology 
region located downstream of the non-homologous replacement portion, the second 
homology region having a nucleotide sequence with substantial identity to a second 
25 MowgU gene sequence, the second Mowgli gene sequence having a location downstream 
of the first Mowgli gene sequence in a naturally occurring endogenous Mowgli gene. 
• Additionally, the first and second homology regions are of sufficient length for 
homologous recombination between the nucleic acid construct and an endogenous Mowgli 
gene in a host cell when the nucleic acid molecule is introduced into the host cell. In a 
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preferred embodiment, the non-homologous replacement portion comprises an expression 
reporter, preferably including lacZ and a positive selection expression cassette, preferably 
including a neomycin phosphotransferase gene operatively linked to a regulatory 
*elenient(s). 

5 Preferably, the first and second MoAvgli gene sequences are derived from SEQ ID 

NO: 1, SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 7 or SEQ ID NO: 10, or a 
homologue, variant or derivative thereof. 

Another aspect of the invention pertains to recombinant vectors into which the 
nucleic acid construct of the invention has been incorporated. Yet another aspect of the 

10 invention pertains to host cells into which the nucleic acid construct of the invention has 
been introduced to thereby allow homologous recombination between the nucleic acid 
construct and an endogenous Mowgli gene of the host cell, resulting in functional 
disruption of the endogenous Mowgli gene. The host cell can be a mammalian cell that 
normally expresses Mowgli from the liver, brain, spleen or heart, or a pluripotent cell, 

15 such as a mouse embryonic stem cell. Further development of an embryonic stem cell into 
which the nucleic acid construct has been introduced and homologously recombined v^th 
the endogenous Mowgli gene produces a transgenic nonhuman animal having cells that 
are descendant from the embryonic stem cell and thus carry the Mowgli gene disruption in 
their genome. Animals that carry the Mowgli gene disruption in their germline can then be 

20 selected and bred to produce animals having the Mowgli gene disruption in all somatic 
and germ cells. Such mice can then be bred to homozygosity for the Mowgli gene 
disruption. 

A Mowgli GPCR deficient transgenic animal may be generated as follows: 
Construction of MOWGLI Gene Targeting Vector 



25 



Murine MOWGLI genomic clones are isolated from a mouse large insert PAC 
library obtained from HGMP (Hinxton, UK) using a probe sequence amplified from a part 
of the predicted murine open reading frame cDNA sequence (SEQ ID NO: 4), using 
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Standard techniques. The isolated murine MOWGLI genomic clones are then restriction 
mapped in the region of the MOWGLI gene using small oligonucleotide probes and 
standard techniques. 

The murine genomic locus is partially sequenced to enable the design of 
homologous arms to clone into the targeting vector. Two regions of DNA, typically 
between 1 and 5kb in size, from either side of the region of the open reading frame to be 
deleted, called the 5' and 3' homology arms, are amplified by PGR and the fragments are 
cloned into the targeting vector. The position of these arms is chosen so that a homologous 
recombination event will functionally disrupt the MOWGLI gene by deleting at least the 
seven trans-membrane spanning regions. A targeting vector is prepared where the deleted 
MOWGLI sequence is replaced with non-homologous sequences composed of an 
endogenous gene expression reporter (a frame-independent lacZ gene) upstream of a 
selection cassette composed of a promoted neomycin phosphotransferase (neo) gene, 
arranged in the same orientation as the MOWGLI gene. 

The mouse genomic region used in the knockout strategy for the Mowgli gene is 
depicted as SEQ ID NO: 10 

Transfection and Analysis of Embryonal Stem Cells 




Embryonal stem cells (Evans and Kaufman, 1981, Establishment in culture of 
pluripotential cells from mouse embryos. Nature. 1981 Jul 9;292(58 19): 154-6) are 

20 cultured on a neomycin resistant embryonal fibroblast feeder layer grown in Dulbecco's 
Modified Eagles medium supplemented with 20% Fetal Calf Serum, 10% new-bom calf 
serum, 2 mM glutamine, non-essential amino acids, 100 |aM 2-mercaptoethanol and 500 
u/ml leukemia inhibitory factor. Medium is changed daily and ES cells are subcultured 
every three days. 5x10^ ES cells are transfected with 5 \ig of linearized plasmid by 

25 electroporation (25 \xF capacitance and 400 Volts). 24 hours following electroporation the 
transfected cells are cultured for 9 days in medium containing 200 ^xg/ml neomycin. 
Clones are picked into 96 well plates, repUcated and expanded before being screened by 
PGR to identify clones in which homologous recombination had occurred between the 
endogenous Mowgli gene and the targeting construct. Positive clones are typically 
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identified at a rate of 1 to 5%.. These clones were expanded to allow replicas to be frozen 
and sufficient high quality DNA to be prepared for Southern blot confirmation of the 
targeting event using external 5' and 3' probes, all using standard procedures (Russ et al. 
Nature 2000 Mar 2;404(6773):95-9). 

Generation ofMowgli GPCR Deficient Mice 

C57BL/6 female and male mice are mated and blastocysts are isolated at 3.5 days 
of gestation. 10-12 cells from a chosen clone are injected per blastocyst and 7-8 
blastocysts are implanted in the uterus of a pseudopregnant Fl female. A litter of chimeric 
pups are bom containing several high level (up to 100%) agouti males (the agouti coat 
colour indicates the contribution of cells descendent from the targeted clone). The male 
chimeras are mated with feniale and MFl and 129 mice, and germline transmission is 
determined by the agouti coat colour and by PGR genotyping respectively. 

AiNTIBODIES 

The present invention further provides for antibodies which bind to a Mowgli 
polypeptide, fragment, homologue, variant or derivative thereof Particularly Mowgli 
expression, and in particular in diagnosing a Mowgli GPCR associated disease. Other 
preferred antibodies include those which have therapeutic activity, i.e., which are may be 
used in a therapeutic manner to treat, manage or prevent any Mowgh GPCR associated 
disease. 

For the purposes of this invention, the term "antibody", unless specified to the 
contrary, includes but is not limited to, polyclonal, monoclonal, chimeric, single chain. Fab 
fragments and fragments produced by a Fab expression Ubrary. Such fragments include 
fragments of whole antibodies which retain their binding activity for a target substance, Fv, 
F(ab') and F(ab')2 fragments, as well as single chain antibodies (scFv), fusion proteins and 
other synthetic proteins which comprise the antigen-binding site of the antibody. The 
antibodies and fragments thereof may be humanised antibodies, for example as described in 
EP-A-239400. Furthermore, antibodies with fully human variable regions (or their 
fragments), for example, as described in US Patent Nos. 5,545,807 and 6,075,181 may also 
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be used. Neutralizing antibodies, i.e., those which inhibit biological activity of the 
substance amino acid sequences, are especially preferred for diagnostics and therapeutics. 

Antibodies may be produced by standard techniques, such as by immunisation or 
by using a phage display library. 

A polypeptide or peptide of the present invention may be used to develop an 
antibody by known techniques. Such an antibody may be capable of binding specifically 
to the Mowgli GPCR protein or homologue, fragment, etc. 

If polyclonal antibodies are desired, a selected mammal (e.g., mouse, rabbit goat, 
horse, etc.) may be immunised with an immunogenic composition comprising a 
polypeptide or peptide of the present invention. Depending on the host species, various 
adjuvants may be used to increase immunological response. Such adjuvants include, but 
are not limited to, Freund's, mineral gels such as aluminium hydroxide, and surface active 
substances such as lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, 
keyhole limpet hemocyanin, and dinitrophenol. BCG (Bacilli Calmette-Guerin) and 
Corynebacterium parvum are potentially useful human adjuvants which may be employed 
if purified the substance amino acid sequence is administered to immunologically 
compromised individuals for the purpose of stimulating systemic defence. 

Serum from the immunised animal is collected and treated according to known 
procedures. If serum containing polyclonal antibodies to an epitope obtainable from a 
polypeptide of the present invention contains antibodies to other antigens, the polyclonal 
antibodies can be purified by immunoaffinity chromatography. Techniques for producing 
and processing polyclonal antisera are known in the art. In order that such antibodies may 
be made, the invention also provides amino acid sequences of the invention or firagments 
thereof haptenised to another amino acid sequence for use as immxmogens in animals or 
humans. 

Monoclonal antibodies directed against epitopes obtainable firom a polypeptide or 
peptide of the present invention can also be readily produced by one skilled in the art. The 
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general methodology for making monoclonal antibodies by hybridomas is well known. 
Immortal antibody-producing cell lines can be created by cell fusion, and also by other 
techniques such as direct transformation of B lymphocytes with oncogenic DNA, or 
transfection with Epstein-Barr virus. Panels of monoclonal antibodies produced against 
orbit epitopes can be screened for various properties; i.e., for isotype and epitope affinity. 

Monoclonal antibodies may be prepared using any technique which provides for 
the production of antibody molecules by continuous ceil lines in culture. These include, 
but are not limited to, the hybridoma technique originally described by Koehler and 
Milstein (1975 Nature 256:495-497), the trioma technique, the human B-cell hybridoma 
technique (Kosbor et al ("1983) hnmunol Today 4:72; Cote et al (1983) Proc Natl Acad Sci 
80:2026-2030) and the EBV-hybridoma technique (Cole et al.. Monoclonal Antibodies 
and Cancer Therapy, pp. 77-96, Alan R. Liss, Inc., 1985). 

In addition, techniques developed for the production of "chimeric antibodies", the 
splicing of mouse antibody genes to human antibody genes to obtain a molecule v^th 
appropriate antigen specificity and biological activity can be used (Morrison et al (1984) 
Proc Natl Acad Sci 81:6851-6855; Neuberger et al (1984) Nature 312:604-608; Takeda et 
al (1985) Nature 314:452-454). Alternatively, techniques described for the production of 
single chain antibodies (US Patent No. 4,946,779) can be adapted to produce the substance 
specific single chain antibodies. 

Antibodies, both monoclonal and polyclonal, which are directed against epitopes 
obtainable firom a polypeptide or peptide of the present invention are particularly useful in 
diagnosis, and those which are neutralising are useful in passive immunotherapy. 
Monoclonal antibodies, in particular, may be used to raise anti-idiotype antibodies. Anti- 
idiotype antibodies are immxmoglobulins which carry an "internal image" of the substance 
and/or agent against which protection is desired. Techniques for raising anti-idiotype 
antibodies are known in the art. These anti-idiotype antibodies may also be useful in 
therapy. 
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Antibodies may also be produced by inducing in vivo production in the lymphocyte 
population or by screening recombinant immunoglobulin libraries or panels of highly 
specific binding reagents as disclosed in Orlandi et al (1989, Proc Natl Acad Sci 86: 3833- 
3837), and Winter G and Milstein C (1991; Nature 349:293-299). 

Antibody fragments which contain specific binding sites for the polypeptide or 
peptide may also be generated. For example, such fragments include, but are not limited 

to, the F(ab')2 fragments which can be produced by pepsin digestion of the antibody 
molecule and the Fab fragments which can be generated by reducing the disulfide bridges 
of the F(ab')2 fragments. Alternatively, Fab expression libraries may be constructed to 
allow rapid and easy identification of monoclonal Fab fragments with the desired 
specificity (Huse WD et al (1989) Science 256:1275-128 1). 

Techniques for the production of single chain antibodies (U.S. Pat. No. 4,946,778) 
can also be adapted to produce single chain antibodies to polypeptides of this invention. 
Also, transgenic mice, or other organisms including other mammals, may be used to 
express humanized antibodies. 

The above-described antibodies may be employed to isolate or to identify clones 
expressing the polypeptide or to purify the polypeptides by affinity chromatography. 

Antibodies against Mowgli GPCR polypeptides may also be employed to treat 
infections such as bacterial, fungal, protozoan and viral infections, particularly infections 
caused by HIV-1 or HIV-2; pain; cancers; diabetes, obesity; anorexia; bulimia; asthma; 
Parkinson's disease; thrombosis; acute heart failure; hypotension; hypertension; erectile 
dysfunction; urinary retention; metabolic bone diseases such as osteoporisis and osteo 
petrosis; angina pectoris; myocardial infarction; ulcers; asthma; allergies; rheumatoid 
arthritis; inflammatory bowel disease; irritable bowel syndrome benign prostatic 
hypertrophy; and psychotic and neurological disorders, including anxiety, schizophrenia, 
nianic depression, delirium, dementia, severe mental retardation and dyskinesias, such as 
Huntington's disease or Gilles dela Toufett's syndrome, as well as platelet ADP receptor 
defect, autosomal recessive deafness, myotonic dystrophy, essential hypertension. Usher 
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syndrome, in particular type 3 Usher syndrome, juvenile-onset cataract, congenital 
cataract, and 3-methylcrotonylglyciiiuria I. 

Diagnostic Assays 

This invention also relates to the use of Mowgli GPCR polynucleotides and 
polypeptides (as well as homologues, variants and derivatives thereof) for use in diagnosis 
as diagnostic reagents or in genetic analysis. Nucleic acids complementary to or capable of 
hybridising to Mowgli GPCR nucleic acids (including homologues, variants and 
derivatives), as well as antibodies against Mowgli polypeptides are also useful in such 
assays. 

We provide for a nattiral variant of Mowgli polypeptide or nucleic acid, and the 
use of such a natural variant in diagnosis of Mowgli associated disease. Mowgli 
polymorphisms may include differences at the nucleic acid level, which may or may not 
reflect differences in the amino acid level. Preferably, such Mowgli variants or mutants are 

such that they include changes in the amino acid level. However, the invention also 
encompasses Mowgli polymorphisms which occur in non-coding regions, for example, 
expression control regions such as promoters and enhancers. 

Polymorphisms in Mowgli include deletions of one or more nucleic acids, 
insertions of one or more nucleic acids, inversions, etc. Preferably, Mowgli 
polymorphisms comprise single nucleotide polymorphisms. 

Polymorphisms in Mowgli may be identified by comparing sequences at the 
appropriate level (whether nucleic acid or protein) between individuals in a population. 
Differences in sequences may be reflected in different physical properties, and techniques 
for detecting these may rely on detection of changes in physical properties. For example, 
single nucleotide polymorphisms may be detected as restriction fragment length 
polymorphisms (i.e., difference in susceptiability to digestion by a restriction enzyme). 
Furthermore, SNPs may affect the migration or mobility of a nucleic acid fragment or 
protein fragment in a gel. 
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Non-coding polymorphisms in Mowgli may be identified by sequencing non- 
coding regions of Mowgli. For example, control regions of the Mowgli gene, such as 
enhancers and promoters may be sequenced to identify polymorphisms. The effect of such 
non-coding polymorphisms on the expression level of Mowgli may be determined by 

constructing transgenic mice (as described below) comprising the mutant Mowgli 
sequences, or by generating expression constructs and transfection into cell lines. In each 
case, the expression level of Mowgli is detected, by RT-PCR or antibody Western 
staining, to determine the effect of the mutation in the control of expression of Mowgli. 
Useful Mowgli polymorphisms are those which modulate the level of expression, wiether 
by up-regulation or down-regulation of Mowgli levels. 

Accordingly, this invention provides for a variant or mutant or polymorphism in a 
non-coding region of Mowgli, preferably in a control region of Mowgli, preferably in a 
promoter and/or enhancer of Mowgli, which is capable of modulating the level of 
expression of Mowgli in an organism. The invention also provides for a set of two or more 

of such mutants or variants or polymorphisms, preferably non-coding polymorphisms. The 
invention also provides for the use of such variants or polymorphisms or sets of variants to 
identify nucleic acid and/or amino acid positions, in which changes to such positions 
affect the level of expression of Mowgli. The invention also provides for a transgenic 
animal comprising, a variant or mutant or polymorphism of Mowgli, preferably, a non- 
coding polymorphism. 

Detection of a mutated form of the Mowgli GPCR gene associated with a 
dysfunction will provide a diagnostic tool that can add to or define a diagnosis of a disease 
or susceptibility to a disease which results from under-expression, over-expression or 
altered expression of Mowgli GPCR. Individuals carrying mutations in the Mowgli GPCR 
gene (including control sequences) may be detected at the DNA level by a variety of 
techniques. 

For example, DNA may be isolated from a patient and the DNA polymorphism 
pattern of Mowgli determined. The identified pattem is compared to controls of patients 
known to be suffering from a disease associated with over-, under- or abnormal expression 
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of Mowgli. Patients expressing a genetic polymorphism pattern associated with Mowgli 
associated disease may then be identified. Genetic analysis of the Mowgli GPCR gene 
may be conducted by any technique known in the art. For example, individuals may be 
screened by determining DNA sequence of a Mowgli allele, by RPLP or SNP analysis, 
etc. Patients may be identified as having a genetic predisposition for a disease associated 
with the over-, under-, or abnormal expression of Mowgli by detecting the presence of a 
DNA polymorphism in the gene sequence for Mowgli or any sequence controlling its 
expression. 

Patients so identified can then be treated to prevent the occurrence of Mowgli 
associated disease, or more aggressively in the early stages of Mowgli associated disease 
to prevent the further occurrence or development of the disease. Mowgli associated 
diseases include infections such as. bacterial, fungal, protozoan and viral infections, 
particularly infections caused by HIV-l or HIV-2; pain; cancers; diabetes, obesity; 
anorexia; bulimia; asthma; Parkinson's disease; thrombosis; acute heart failure; 
hypotension; hypertension; erectile dysfunction; urinary retention; metabolic bone 
diseases such as osteoporisis and osteo petrosis; angina pectoris; myocardial infarction; 
ulcers; asthma; allergies; rheumatoid arthritis; inflammatory bowel disease; irritable bowel 
syndrome benign prostatic hypertrophy; and psychotic and neurological disorders, 
including anxiety, schizophrenia, manic depression, delirium, dementia, severe mental 
retardation and dyskinesias, such £is Himtington's disease or Gilles dela Tourett's 
syndrome, and also platelet ADP receptor defect, autosomal recessive deafness, myotonic 
dystrophy, essential hypertension. Usher syndrome, in particular type 3 Usher syndrome, 
juvenile-onset cataract, congenital cataract, and 3-methylcrotonylglycinuria I. 

The present invention further discloses a kit for the identification of a patient's 
genetic polymorphism pattern associated with Mowgli associated disease. The kit includes 
DNA sample collecting means and means for determining a genetic polymorphism 
pattern, which is then compared to control samples to determine a patient's susceptibility 
to Mowgli associated disease. Kits for diagnosis of a Mowgli associated disease 
comprising Mowgli polypeptide and/or an antibody against such a polypeptide (or 
fragment of it) are also provided. 
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Nucleic acids for diagnosis may be obtained from a subject's ceils, such as from 
blood, urine, saliva, tissue biopsy or autopsy material. In a preferred embodiment, the 
DNA is obtained from blood cells obtained from a finger prick of the patient with the 
blood collected on absorbent paper. In a further preferred embodiment, the blood is 
collected on an AmpliCard.TM. (University of Sheffield, Department of Medicine and 
Pharmacology, Royal Hallamshire Hospital, Sheffield, England SIO 2JF). 

The DNA may be used directly for detection or may be amplified enzymatically by 
using PGR or other amplification techniques prior to analysis. Oligonucleotide DNA 
primers that target the specific polymorphic DNA region within the genes af interest may 
be prepared so that in the PGR reaction amplification of the target sequences is achieved. 
RNA or cDNA may also be used as templates in similar fashion. The amplified DNA 
sequences from the template DNA may then be analyzed using restriction enzymes to 
determine the genetic polymorphisms present in the amplified sequences and thereby 
provide a genetic polymorphism profile of the patient. Restriction fragments lengths may 
be identified by gel analysis. Alternatively, or in conjunction, techniques such as SNP 
(single nucleotide polymorphisms) analysis may be employed. 

Deletions and insertions can be detected by a change in size of the amplified 
product in comparison to the normal genotype. Point mutations can be identified by 
hybridizing amplified DNA to labeled Mowgli GPGR nucleotide sequences. Perfectly 
matched sequences can be distinguished from mismatched duplexes by RNase digestion or 
by dijBferences in melting temperatures. DNA sequence differences may also be detected 
by alterations in electrophoretic mobility of DNA fragments in gels, with or without 
denaturing agents, or by direct DNA sequencing. See, eg., Myers et al. Science 
(1985)230:1242. Sequence changes at specific locations may also be revealed by nuclease 
protection assays, such as RNase and SI protection or the chemical cleavage method. See 
Gotton et al., Proc Natl Acad Sci USA (1985) 85: 4397-4401. In another embodiment, an 
array of oligonucleotides probes comprising the Mowgli GPGR nucleotide sequence or 
fragments thereof can be constructed to conduct efficient screening of e.g., genetic 
mutations. Array technology methods are well known and have general applicability and 
can be used to address a variety of questions in molecular genetics including gene 
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expression, genetic linkage, and genetic variability. (See for example: M.Chee et al,, 
Science, Vol 274, pp 610-613 (1996)). 

Single strand conformation polymorphism (SSCP) may be used to detect 
differences in electrophoretic mobility between mutant and wild type nucleic acids (Orita 
et al. (1989) Proc Natl Acad Sci USA: 86:2766, see also Cotton (1993) Mutat Res 
285:125-144; and Hayashi (1992) Genet Anal Tech Appl 9:73-79). Single-stranded DNA 
fragments of sample and control Mowgli nucleic acids may be denatured and allowed to 
renature. The secondary structure of single-stranded nucleic acids varies according to 
sequence, the resulting alteration in electrophoretic mobility enables the detection of even 
a single base change. The DNA fragments may be labelled or detected with labelled 
probes. The sensitivity of the assay may be enhanced by using RNA (rather than DNA), in 
which the secondary structure is more sensitive to a change in sequence. In a preferred 
embodiment the subject method utilizes heteroduplex analysis to separate double stranded 
heteroduplex molecules on the basis of changes in electrophoretic mobility (Keen et al. 
{\99\) Trends Genet 1\5). 

The diagnostic assays offer a process for diagnosing or determining a susceptibility 
to Mowgli associated diseases, for example, infections such as bacterial, fungal, protozoan 
and viral infections, particularly infections caused by HIV-1 or HIV-2; pain; cancers; 
diabetes, obesity; anorexia; bulimia; asthma; Parkinson's disease; thrombosis; acute heart 
failure; hypotension; hypertension; erectile dysfunction; urinary retention; metabolic bone 
diseases such as osteoporisis and osteo petrosis; angina pectoris; myocardi£d infarction; 
ulcers; asthma; allergies; rheumatoid arthritis; inflammatory bowel disease; irritable bowel 
syndrome benign prostatic hypertrophy; and psychotic and neurological disorders, 
including anxiety, schizophrenia, manic depression, delirium, dementia, severe mental 
retardation and dyskinesias, such as Huntington's disease or Gilles dela Tourett's syndrome 
through detection of mutation in the Mowgli GPCR gene by the methods described. In 
particular, the diagnostic methods are useful to determine susceptibility to platelet ADP 
receptor defect, autosomal recessive deafness, myotonic dystrophy, essential hypertension. 
Usher syndrome, in particular type 3 Usher syndrome, juvenile-onset cataract, congenital 
cataract, and 3-methylcrotonylglycinuria 1. 
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The presence of Mowgli GPCR polypeptides and nucleic acids may be detected in 
a sample. Thus, infections and diseases as listed above can be diagnosed by methods 
comprising detemiining from a sample derived from a subject an abnomially decreased or 
increased level of the Mowgli GPCR polypeptide or Mowgli GPCR mRNA. The sample 
may comprise a cell or tissue sample from an organism suffering or suspected to be 
suffering from a disease associated with increased, reduced or otherwise abnormal Mowgli 
GPCR expression, including spatial or temporal changes in level or pattern of expression. 
The level or pattern of expression of Mov/gli in an organism suffering from or suspected 
to be suffering from such a disease may be usefully compared with the level or partem of 
expression in a normal organism as a means of diagnosis of disease. 

In general therefore, the invention includes a method of detecting the presence of a 
nucleic acid comprising a Mowgli GPCR nucleic acid in a sample, by contacting the 
sample with at least one nucleic acid probe which is specific for said nucleic acid and 
monitoring said sample for the presence of the nucleic acid. For example, the nucleic acid 
probe may specifically bind to the Mowgli GPCR nucleic acid, or a portion of it, and 
binding between the two detected; the presence of the complex itself may also be detected. 
Furthermore, the invention encompasses a method of detecting the presence of a Mowgli 
GPCR polypeptide by contacting a cell sample with an antibody capable of binding the 
polypeptide and monitoring said sample for the presence of the polypeptide. This may 
conveniently be achieved by monitoring the presence of a complex formed between the 
antibody and the polypeptide, or monitoring the binding between the polypeptide and the 
antibody. Methods of detecting binding between two entities are known in the art, and 
include FRET (fluorescence resonance energy transfer), surface plasmon resonance, etc. 

Decreased or increased expression can be measured at the RNA level using any of 
the methods well known in the art for the quantitation of polynucleotides, such as, for 
example, PCR, RT-PCR, RNase protection, Northem blotting and other hybridization 
methods. Assay techniques that can be used to determine levels of a protein, such as a 
Mowgli GPCR, in a sample derived from a host are well-known to those of skill in the art. 
Such assay methods include radioimmunoassays, competitive-binding assays, Westem 
Blot analysis and ELISA assays. 
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The present invention relates to a diagnostic kit for a disease or susceptibility to 
aMowgli associated disease (including an infection), for example, infections such as 
bacterial, fungal, protozoan and viral infections, particularly infections caused by HIV-1 
or HIV-2; pain; cancers; diabetes, obesity; anorexia; bulimia; asthma; Parkinson's disease; 
thrombosis; acute heart failure; hypotension; hypertension; erectile dysfunction; urinary 
retention; metabolic bone diseases such as osteoporisis and osteo petrosis; angina pectoris; 
myocardial infarction; ulcers; asthma; allergies; rheumatoid arthritis; inflammatory bowel 
disease; irritable bowel syndrome benign prostatic hypertrophy; and psychotic and 
neurological disorders, including anxiety, schizophrenia, manic depression, delirium, 
dementia, severe mental retardation and dyskinesias, such as Huntington's disease or 
Gilles dela Tourett's syndrome, as well as platelet ADP receptor defect, autosomal 
recessive deafness, myotonic dystrophy, essential hypertension. Usher syndrome, in 
particular type 3 Usher syndrome, juvenile-onset cataract, congenital cataract, and 3- 
methylcrotonylglycinuria I. The diagnostic kit comprises a Mowgli GPCR polynucleotide 
or a fragment thereof; a complementary nucleotide sequence; a Mowgli GPCR 
polypeptide or a fragment thereof, or an antibody to a Mowgli GPCR polypeptide. 

Chromosome Assays 

The nucleotide sequences of the present invention are also valuable for 
chromosome identification. The sequence is specifically targeted to and can hybridize with 
a particular location on an individual human chromosome. As described above, human 
Mowgli GPCR is found to map to Homo sapiens chromosome 3q21-q25. 

The mapping of relevant sequences to chromosomes according to the present 
invention is cui important first step in correlating those sequences with gene associated 
disease. Once a sequence has been mapped to a precise chromosomal location, the 
physical position of the sequence on the chromosome can be correlated with genetic map 
data. Such data are found, for example, in V. McKusick, Mendelian heritance in Man 
(available on line through Johns Hopkins University Welch Medical Library). The 
relationship between genes and diseases that have been mapped to the same chromosomal 
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region are then identified through linkage analysis (coinheritance of physically adjacent 
genes). 

The differences in the cDNA or genomic sequence between affected and 
unaffected individuals can also be determined. If a mutation is observed in some or all of 
the affected individuals but not in any normal individuals, then the mutation is likely to be 
the causative agent of the disease. 

Prophylactic and Therapeutic Methods 

This invention provides methods of treating an abnormal conditions related to both 
an excess of and insufficient amounts of Mowgli GPCR activity. 

If the activity of Mowgli GPCR is in excess, several approaches are available. One 
approach comprises administering to a subject an inhibitor compound (antagonist) as 
hereinabove described along with a pharmaceutically acceptable carrier in an amount 
effective to inhibit activation by blocking binding of ligands to the Mowgli GPCR, or by 
inhibiting a second signal, and thereby alleviating the abnormal condition. 

In another approach, soluble forms of Mowgli GPCR polypeptides still capable of 
binding the ligand in competition with endogenous Mowgli GPCR may be administered. 
Typical embodiments of such competitors comprise fragments of the Mowgli GPCR 
polypeptide. 

In still another approach, expression of the gene encoding endogenous Mowgli 
GPCR can be inhibited using expression blocking techniques. Known such techniques 
involve the use of antisense sequences, either internally generated or separately 
administered. See, for example, O'Connor, JNeurochem (1991) 56:560 in 
Oligodeoxvnucleotides as Antisense Inhibitors of Gene Expression, CRC Press, Boca 
Raton, Fla. (1988). Alternatively, oligonucleotides which form triple helices with the gene 
can be supplied. See, for example, Lee et al.. Nucleic Acids Res (1979) 6:3073; Cooney et 
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al., Science (1988) 241:456; Dervan et al., Science (1991) 251:1360. These oligomers can 
be administered per se or the relevant oligomers can be expressed in vivo. 

For treating abnormal conditions related to an under-expression of Mowgli GPCR 
and its activity, several approaches are also available. One approach comprises 
administering to a subject a therapeutically effective amount of a compound which 
activates Mowgli GPCR, i.e., an agonist as described above, in combination with a 
pharmaceutically acceptable carrier, to thereby alleviate the abnormal condition. 
Alternatively, gene therapy may be employed to effect the endogenous production of 
Mowgli GPCR by the relevant cells in the subject. For example, a polynucleotide of the 
invention may be engineered for expression in a replication defective retroviral vector, as 
discussed above. The retroviral expression construct may then be isolated and introduced 
into a packaging cell transduced with a retroviral plasmid vector containing RNA 
encoding a polypeptide of the present invention such that the packaging cell now produces 
infectious viral particles containing the gene of interest. These producer cells may be 
administered to a subject for engineering cells in vivo and expression of the polypeptide in 
vivo. For overview of gene therapy, see Chapter 20, Gene Therapy and other Molecular 
Genetic-based Therapeutic Approaches, (and references cited therein) in Human 
Molecular Genetics, T Strachan and A P Read, BIOS Scientific Publishers Ltd (1996). 

Formulation and Administration 

Peptides, such as the soluble form of Mowgli GPCR polypeptides, and agonists 
and antagonist peptides or small molecules, may be formulated in combination with a 
suitable pharmaceutical carrier. Such formulations comprise a therapeutically effective 
amount of the polypeptide or compound, and a pharmaceutically acceptable carrier or 
excipient. Such carriers include but are not limited to, saline, buffered saline, dextrose, 
water, glycerol, ethanol, and combinations thereof. Formulation should suit the mode of 
administration, and is well within the skill of the art. The invention further relates to 
pharmaceutical packs and kits comprising one or more containers filled with one or more 
of the ingredients of the aforementioned compositions of the invention. 
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Polypeptides and other compounds of the present invention may be employed 
alone or in conjunction with other compounds, such as therapeutic compounds. 

Preferred forms of systemic administration of the pharmaceutical compositions 
include injection, typically by intravenous injection. Other injection routes, such as 
subcutaneous, intramuscular, or intraperitoneal, can be used. Alternative means for 
systemic administration include transmucosal and transdermal administration using 
penetrants such as bile salts or fiisidic acids or other detergents. In addition, if properly 
formulated in enteric or encapsulated formulations, oral administration may also be 
possible. Administration of these compounds may also be topical and/or localize, in the 
form of salves, pastes, gels and the like. 

The dosage range required depends on the choice of peptide, the route of 
administration, the nature of the formulation, the nature of the subject's condition, and the 
judgment of the attending practitioner. Suitable dosages, however, are in the range of 0.1- 
100 fig/kg of subject. Wide variations in the needed dosage, however, are to be expected 
in view of the variety of compounds available and the differing efficiencies of various 
routes of administration. For example, oral administration would be expected to require 
higher dosages than administration by intravenous injection. Variations in these dosage 
levels can be adjusted using standard empirical routines for optimization, as is well 
understood in the art. 

Polypeptides used in treatment can also be generated endogenously in the subject, 
in treatment modalities often referred to as "gene therapy" as described above. Thus, for 
example, cells from a subject may be engineered with a polynucleotide, such as a DNA or 
RNA, to encode a polypeptide ex vivo, and for example, by the use of a retroviral plasmid 
vector. The cells are then introduced into the subject. 

Pharmaceutical Compositions 

The present invention also provides a pharmaceutical composition comprising 
administering a therapeutically effective amount of the polypeptide, polynucleotide. 
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peptide, vector or antibody of the present invention and optionally a pharmaceutically 
acceptable carrier, diluent or excipients (including combinations thereof). 

The pharmaceutical compositions may be for human or animal usage in human and 
veterinary medicine and will typically comprise any one or more of a pharmaceutically 
acceptable diluent, carrier, or excipient. Acceptable carriers or diluents for therapeutic use 
are well known in the pharmaceutical art, and are described, for example, in Remington's 
Pharmaceutical Sciences, Mack Publishing Co. (A. R. Gennaro edit. 1985). The choice of 
pharmaceutical carrier, excipient or diluent can be selected with regard to the intended 
route of administration and standard pharmaceutical practice. The pharmaceutical 
compositions may comprise as - or in addition to - the carrier, excipient or diluent any 
suitable binder(s), lubricant(s), suspending agent(s), coating agent(s), solubilising agent(s). 

Preservatives, stabilizers, dyes and even flavoring agents may be provided in the 
pharmaceutical composition. Examples of preservatives include sodium benzoate, sorbic 
acid and esters of p-hydroxybenzoic acid. Antioxidants and suspending agents may be also 
used. 

There may be different composition/formulation requirements dependent on the 
different delivery systems. By way of example, the pharmaceutical composition of the 
present invention may be formulated to be delivered using a mini-pump or by a mucosal 
route, for example, as a nasal spray or aerosol for inhalation or ingestible solution, or 
parenterally in which the composition is formulated by an injectable form, for deUvery, 
by, for example, an intravenous, intramuscular or subcutaneous route. Alternatively, the 
formulation may be designed to be delivered by both routes. 

Where the agent is to be delivered mucosally through the gastrointestinal mucosa, 
it should be able to remain stable during transit though the gastrointestinal tract; for 
example, it should be resistant to proteolytic degradation, stable at acid pH and resistant to 
the detergent effects of bile. 
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Where appropriate, the pharmaceutical compositions can be administered by 
inhalation, in the form of a suppository or pessary, topically in the form of a lotion, 
solution, cream, ointment or dusting powder, by use of a skin patch, orally in the form of 
tablets containing excipients such as starch or lactose, or in capsules or ovules either alone 
or in admixture with excipients, or in the form of elixirs, solutions or suspensions 
containing flavouring or colouring agents, or they can be injected parenterally, for 
example intravenously, intramuscularly or subcutaneously. For parenteral administration, 
the compositions may be best used in the form of a sterile aqueous solution which may 
contain other substances, for example enough salts or monosaccharides to make the 
solution isotonic with blood. For buccal or sublingual administration the compositions 
may be administered in the form of tablets or lozenges which can be formulated in a 
conventional manner. 

Vaccines 

Another embodiment of the invention relates to a method for inducing an 
immunological response in a mammal which comprises inoculating the mammal with the 
Mowgli GPCR polypeptide, or a fragment thereof, adequate to produce antibody and/or T 
cell immune response to protect said animal from infections such as bacterial, fungal, 
protozoan and viral infections, particularly infections caused by HIV-1 or HIV -2; pain; 
cancers; diabetes, obesity; anorexia; bulimia; asthma; Parkinson's disease; thrombosis; 
acute heart failure; hypotension; hypertension; erectile dysfunction; urinary retention; 
metabolic bone diseases such as osteoporisis and osteo petrosis; angina pectoris; 
myocardial infarction; ulcers; asthma; allergies; rheumatoid arthritis; inflammatory bowel 
disease; irritable bowel syndrome benign prostatic hypertrophy; and psychotic and 
neurological disorders, including anxiety, schizophrenia, manic depression, delirium, 
dementia, severe mental retardation and dyskinesias, such as Huntington's disease or 
Gilles dela Tourett*s syndrome. The induced immune response may also be employed to 
protect the animal from other diseases such as platelet AD? receptor defect, autosomal 
recessive deafness, myotonic dystrophy, essential hypertension, Usher syndrome, in 
particular type 3 Usher syndrome, juvenile-onset cataract, congenital cataract, and 3- 
methylcrotonylglycinuria 1. 
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Yet another embodiment of the invention relates to a method of inducing 
immunological response in a mammal which comprises delivering a Mowgli GPCR 
polypeptide via a vector directing expression of a Mowgli GPCR polynucleotide in vivo in 
order to induce such an immunological response to produce antibody to protect said 
animal from diseases. 

A further embodiment of the invention relates to an inununological/vaccine 
formulation (composition) which, when introduced into a mammalian host, induces an 
immunological response in that mammal to a Mowgli GPCR polypeptide wherein the 
composition comprises a Mowgli GPCR polypeptide or Mowgli GPCR gene. The vaccine 
formulation may further comprise a suitable carrier. 

Since the Mowgli GPCR polypeptide may be broken down in the stomach, it is 
preferably administered parenterally (including subcutaneous, intramuscular, intravenous, 
intradermal etc. injection). Formulations suitable for parenteral administration include 
aqueous and non-aqueous sterile injection solutions which may contain anti-oxidants, 
buffers, bacteriostats and solutes which render the formulation instonic with the blood of 
the recipient; and aqueous and non-aqueous sterile suspensions which may include 
suspending agents or thickening agents. The formulations may be presented in unit-dose or 
multi-dose containers, for example, sealed ampoules and vials and may be stored in a 
freeze-dried condition requiring only the addition of the sterile liquid carrier immediately 
prior to use. The vaccine formulation may also include adjuvant systems for enhancing the 
immunogenicity of the formulation, such as oil-in water systems and other systems known 
in the art. The dosage will depend on the specific activity of the vaccine and can be readily 
determined by routine experimentation. 

Vaccines may be prepared from one or more polypeptides or peptides of the 
present invention. 

The preparation of vaccines which contain an immunogenic polypeptide(s) or 
peptide(s) as active ingredient(s), is known to one skilled in the art. Typically, such 
vaccines are prepared as injectables, either as liquid solutions or suspensions; solid forms 
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suitable for solution in, or suspension in, liquid prior to injection may also be prepared. 
The preparation may also be emulsified, or the protein encapsulated in liposomes. The 
active immunogenic ingredients are often mixed with excipients which are 
pharmaceutically acceptable and compatible with the active ingredient. Suitable excipients 
5 are, for example, water, saline, dextrose, glycerol, ethanol, or the like and combinations 
thereof 

In addition, if desired, the vaccine may contain minor amounts of auxiliary 
substances such as wetting or emulsifying agents, pH buffering agents, and/or adjuvants 
which enliance the effectiveness of the vaccine. Examples of adjuvants which may be 

1 0 effective include but are not limited to: aluminum hydroxide, N-acetyl-muramyl-L- 
threonyl-D-isoglutamine (thr-MDP), N-acetyl-nor-muramyl-L-alanyl-D-isoglutamine 
(CGP 11 637, referred to as nor-MDP), N-acetylmuramyl-L-alanyl-D-isoglutaminyl-L- 
alanine-2-( 1 ' -2 ' -dipalmitoy 1-sn-glyceroo -hydroxyphosphoryloxy)-ethylamine (CGP 
19835 A, referred to as MTP-PE), and RIBI, which contains three components extracted 

15 from bacteria, monophosphoryl lipid A, trehalose dimycolate and cell wall skeleton 
(MPL+TDM+CWS) in a 2% squalene/Tween 80 emulsion. 

Further examples of adjuvants and other agents include aluminum hydroxide, 
aluminum phosphate, aluminum potassium sulfate (alum), beryllium sulfate, silica, kaolin, 
carbon, water-in-oil emulsions, oil-in-water emulsions, muramyl dipeptide, bacterial 

20 endotoxin, lipid X, Corynebacterium parvum {Propionobacterium acnes), Bordetella 

pertussis, polyribonucleotides, sodium alginate, lanolin, lysolecithin, vitamin A, saponin, 
liposomes, levamisole, DEAE-dextran, blocked copolymers or other synthetic adjuvants. 
Such adjuvants are available commercially from various sources, for example, Merck 
Adjuvant 65 (Merck and Company, Inc., Rahway, N J.) or Freund's Incomplete Adjuvant 

25 and Complete Adjuvant (Difco Laboratories, Detroit, Michigan). 



Typically, adjuvants such as Amphigen (oil-in-water), Alhydrogel (aluminimi 
hydroxide), or a mixture of Amphigen and Alhydrogel are used. Only aluminum 
hydroxide is approved for human use. 
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The proportion of immunogen and adjuvant can be varied over a broad range so 
long as both are present in effective amounts. For example, aluminum hydroxide can be 
present in an amount of about 0.5% of the vaccine mixture (AI2O3 basis). Conveniently, 
the vaccines are formulated to contain a final concentration of immunogen in the range of 
from 0.2 to 200 ^ig/ml, preferably 5 to 50 j-ig/ml, most preferably 15 |ig/ml. 

After formulation, the vaccine may be incorporated into a sterile container which is 
then sealed and stored at a low temperature, for example 4°C, or it may be freeze-dried, 
Lyophilisation permits long-term storage in a stabilised form. 

The vaccines are conventionally administered parenterally, by injection, for 
example, either subcutaneously or intramuscularly. Additional formulations which are 
suitable for other modes of administration include suppositories and, in some cases, oral 
formulations. For suppositories, traditional binders and carriers may include, for example, 
polyalkylene glycols or triglycerides; such suppositories may be formed from mixtures 
containing the active ingredient in the range of 0.5% to 10%, preferably 1% to 2%. Oral 
formulations include such normally employed excipients as, for example, pharmaceutical 
grades of mannitol, lactose, starch, magnesium stearate, sodium saccharine, cellulose, 
magnesium carbonate, and the like. These compositions take the form of solutions, 
suspensions, tablets, pills, capsules, sustained release formulations or powders and contain 
10% to 95% of active ingredient, preferably 25% to 70%. Where the vaccine composition 
is lyophilised, the lyophilised material may be reconstituted prior to administration, e.g. as 
a suspension. Reconstitution is preferably effected in buffer 

Capsules, tablets and pills for oral administration to a patient may be provided with 
an enteric coating comprising, for example, Eudragit "S", Eudragit "L", cellulose acetate, 
cellulose acetate phthalate or hydroxypropylmethyl cellulose. 

The polypeptides of the invention may be formulated into the vaccine as neutral or 
salt forms. Pharmaceutically acceptable salts include the acid addition salts (formed with 
free amino groups of the peptide) and which are formed with inorganic acids such as, for 
example, hydrochloric or phosphoric acids, or such organic acids such as acetic, oxalic, 
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tartaric and maleic. Salts formed with the free carboxyl groups may also be derived from 
inorganic bases such as, for exeimple, sodium, potassium, ammonium, calcium, or ferric 
hydroxides, and such organic bases as isopropylamine, trimethylamine, 2-ethylamino 
ethanol, histidine and procaine. 

Administration 

Typically, a physician will determine the actual dosage which will be most suitable 
for an individual subject and it will vary with the age, weight and response of the 
particular patient. The dosages below are exemplary of the average case. There can, of 
course, be individual instances where higher or lower dosage ranges are merited. 

The pharmaceutical and vaccine compositions of the present invention may be 
administered by direct injection. The composition may be formulated for parenteral, 
mucosal, intramuscular, intravenous, subcutaneous, intraocular or transdermal 
administration. Typically, each protein may be administered at a dose of from 0.01 to 30 
mg/kg body weight, preferably from 0.1 to 10 mg/kg, more preferably from 0.1 to 1 mg/kg 
body weight. 

The term "administered" includes delivery by viral or non-viral techniques. Viral 
delivery mechanisms include but are not limited to adenoviral vectors, adeno-associated viral 
(AAV) vectos, herpes viral vectors, retroviral vectors, lentiviral vectors, and baculoviral 
vectors. Non- viral delivery mechanisms include lipid mediated transfection, liposomes, 
immunoliposomes, lipofectin, cationic facial amphiphiles (CPAs) and combinations thereof. 
The routes for such delivery mechanisms include but are not limited to mucosal, nasal, 
oral, parenteral, gastrointestinal, topical, or sublingual routes. 

The term "administered" includes but is not limited to delivery by a mucosal route, 
for example, as a nasal spray or aerosol for inhalation or as an ingestable solution; a . 
parenteral route where delivery is by an injectable form, such as, for example, an 
intravenous, intramuscular or subcutaneous route. 
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The term "co-administered" means that the site and time of administration of each 
of for example, the polypeptide of the present invention and an additional entity such as 
adjuvant are such that the necessary modulation of the immune system is achieved. Thus, 
whilst the polypeptide and the adjuvant may be administered at the same moment in time 
and at the same site, there may be advantages in administering the polypeptide at a 
different time and to a different site from the adjuvant. The polypeptide and adjuvant may 
even be delivered in the same delivery vehicle - and the polypeptide and the antigen may 
be coupled and/or uncoupled and/or genetically coupled and/or uncoupled. 

The polypeptide, polynucleotide, peptide, nucleotide, antibody of the invention and 
optionally an adjuvant may be administered separately or co-administered to the host 
■subject as a single dose or in multiple doses. 

The vaccine composition and pharmaceutical compositions of the present invention 
may be administered by a number of different routes such as injection (which includes 
parenteral, subcutaneous and intramuscular injection) intranasal, mucosal, oral, intra- 
vaginal, urethral or ocular administration. • . 

The vaccines and pharmaceutical compositions of the present invention may be 
conventionally administered parenterally, by injection, for example, either subcutaneously 
or intramuscularly. Additional formulations which are suitable for other modes of 
administration include suppositories and, in some cases, oral formulations. For 
suppositories, traditional binders and carriers may include, for example, polyalkylene 
glycols or triglycerides; such suppositories may be formed from mixtures containing the 
active ingredient in the range of 0.5% to 10%, may be 1% to 2%. Oral formulations 
include such normally employed excipients as, for example, pharmaceutical grades of 
mannitol, lactose, starch, magnesium stearate, sodium saccharine, cellulose, magnesium 
carbonate, and the like. These compositions take the form of solutions, suspensions, 
tablets, pills, capsules, sustained release formulations or powders and contain 10% to 95% 
of active ingredient, preferably 25% to 70%. Where the vaccine composition is 
lyophilised, the lyophilised material may be reconstituted prior to administration, e.g. as a 
suspension. Reconstitution is preferably effected in buffer. 
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Further Aspects of the Invention 

Further aspects of the invention are now set out in the following numbered 
paragraphs; it is to be understood that the invention encompasses these aspects: 

Paragraph 1. A host cell comprising: (a) a heterologous nucleic acid which 
comprises the nucleic acid sequence shown in SEQ ID NO: 1, SEQ ID N0:2 or SEQ ID 
..NO: 4, or a fragment, homologue, variant or derivative thereof; or (b) a heterologous 
nucleic acid which encodes a Mowgli GPCR polypeptide comprising the amino acid 
sequence shown in SEQ ID NO: 3 or SEQ ID NO: 5, or a fragment, homologue, variant or 
derivative thereof; (c) a heterologous nucleic acid which encodes a polypeptide which 
comprises one or more regions which are homologous between SEQ ID NO: 3 and SEQ 
ID NO: 5, or which comprises one or more regions which are heterologous between SEQ 
ID NO: 3 and SEQ ID NO: 5; or (d) a vector comprising a nucleic acid according to any of 
(a), (b) or (c) above. 

Paragraph 2. A transgenic non-human animal comprising: (a) a heterologous 
nucleic acid which comprises the nucleic acid sequence shown in SEQ ID NO: 1, SEQ ID 
NO:2 or SEQ ID NO: 4, or a fragment, homologue, variant or derivative thereof; or (b) a 
heterologous nucleic acid which encodes a Mowgli GPCR polypeptide comprising the 
amino acid sequence shown in SEQ ID NO: 3 or SEQ ID NO: 5, or a fragment, 
homologue, variant or derivative thereof; (c) a heterologous nucleic acid which encodes a 
polypeptide which comprises one or more regions which are homologous between SEQ ID 
NO: 3 and SEQ ID NO: 5, or which comprises one or more regions which are 
heterologous between SEQ ID NO: 3 and SEQ ID NO: 5; (d) a vector comprising a 
nucleic acid according to any of (a), (b) or (c) above; or (e) a host cell according to 
Paragraph 1. 

Paragraph 3. A transgenic non-human animal according to Paragraph 2 which is 
a mouse. 
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Paragraph 4. Use of a transgenic non-human animal according to Paragraph 2 or 
3 in a method of identifying a compound which is capable of interacting specifically with 
a G protein coupled receptor. 

Paragraph 5. Use of a Mowgli polypeptide in a method of identifying a 
compound which is capable of interacting specifically with a G protein coupled receptor. 

Paragraph 6, A method for identifying an antagonist of a Mowgli GPCR, the 
method comprising contacting a cell which expresses Mowgli receptor with a candidate 
compound and determining whether the level of cyclic AMP (cAMP) in the cell is lowered 
as a result of said contacting. 

Paragraph 7. A method for identifying a compound capable of lowering the 
endogenous level of cyclic AMP in a cell which method comprises contacting a cell which 
expresses a Mowgli GPCR with a candidate compound and determining whether the level 
of cyclic AMP (cAMP) in the cell is lowered as a result of said contacting. 

Paragraph 8. A method of identifying a compound capable of binding to a 
Mowgli GPCR polypeptide, the method comprising contacting a Mowgli GPCR 
polypeptide with a candidate compound and determining whether the candidate compound 
binds to the Mowgli GPCR polypeptide. 

Paragraph 9. A compound identified by a method according to any of Paragraphs 

2 to 7. 

Paragraph 10. A compound capable of binding specifically to a Mowgli 
polypeptide. 

Paragraph 1 1 . Use of a Mowgli polypeptide or part thereof or a Mowgli nucleic 
acid in a method for producing antibodies. 
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Paragraph 12. An antibody capable of binding specifically to a Mowgli 
polypeptide or part thereof or a polypeptide encoded by a Mowgli nucleic acid or part 
thereof 

Paragraph 13. A pharmaceutical composition comprising any one or more of the 
following: a Mowgli polypeptide, or part thereof; a Mowgli nucleic acid or part thereof; a 
vector comprising Mowgli nucleic acid; a cell according to Paragraph 1 ; a compound 
according to Paragraph 9 or 10; and an antibody according to Paragraph 12, together with 
a pharmaceutically acceptable carrier or diluent. 

Paragraph 14. A vaccine composition comprising any one or more of the 
following: a Mowgli polypeptide, or part thereof; a Mowgli nucleic acid or part thereof; a 
vector comprising Mowgli nucleic acid; a cell according to Paragraph 1 ; a compound 
according to Paragraph 9 or 10; and an antibody according to Paragraph 12. 

Paragraph 15. A diagnostic kit for a disease or susceptibility to a disease 
comprising any one or more of the following: a Mowgli polypeptide, or part thereof; a 
Mowgli nucleic acid or part thereof; a vector comprising Mowgli nucleic acid; a cell 
according to Paragraph 1 ; a compound according to Paragraph 9 or 1 0; and an antibody 
according to Paragraph 12. 

Paragraph 16. A method of treating a patient suffering from a disease associated 
with enhanced activity of a Mowgli GPCR, which method comprises administering to the 
patient an antagonist of Mowgli GPCR. 

Paragraph 17. A method of treating a patient suffering from a disease associated 
with reduced activity of a Mowgli GPCR, which method comprises administering to the 
patient an agonist of Mowgli GPCR. 



Paragraph 18. A method for treating and/or preventing a disease in a patient, 
which comprises the step of administering any one or more of the following to the patient: 
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a Mowgli polypeptide, or part thereof; a Mowgli nucleic acid or part thereof; a vector 
comprising Mowgli nucleic acid; a cell according to Paragraph 1 ; a compound according 
to Paragraph 9 or 10; and an antibody according to Paragraph 12; a pharmaceutical 
composition according to Paragraph 13; or a vaccine according to Paragraph 14. 

Paragraph 19. An agent comprising a Mowgli polypeptide, or part thereof; a 
Mowgli nucleic acid or part thereof; a vector comprising Mowgli nucleic acid; a cell 
according to Paragraph 1 ; a compound according to Paragraph 9 or 1 0; and/or an antibody 
according to Paragraph 12, said agent for use in a method of treatment or prophylaxis of 
disease. 

Paragraph 20. Use of a Mowgli polypeptide, or part thereof; a Mowgli nucleic acid 
or part thereof; a vector comprising Mowgli nucleic acid; a cell according to Paragraph 1 ; 
a compound according to Paragraph 9 or 10; and an antibody according to Paragraph 12, 
for the preparation of a pharmaceutical composition for the treatment or prophylaxis of a 
disease. 

Paragraph 2 1 . A non-human transgenic animal, characterised in that the transgenic 
animal comprises an altered Mowgli gene. 

Paragraph 22. A non-human transgenic animal according to Paragraph 21, in 
which the alteration is selected from the group consisting of: a deletion of a Mowgli gene, 
a mutation in a Mowgli gene resulting in loss of function, introduction of an exogenous 
gene having a nucleotide sequence with targeted or random mutations into a Mowgli gene, 
introduction of an exogenous gene from another species into a Mowgli gene, and a 
combination of any of these. 

Paragraph 23, A non-human transgenic animal having a functionally disrupted 
endogenous Mowgli gene, in which the transgenic animal comprises in its genome and 
expresses a transgene encoding a heterologous Mowgli polypeptide. 
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Paragraph 24. A nucleic acid construct for functionally disrupting a Mowgli gene 
in a host cell, the nucleic acid construct comprising: (a) a non-homologous replacement 
portion; (b) a first homology region located upstream of the non-homologous replacement 
portion, the first homology region having a nucleotide sequence with substantial identity 
to a first Mowgli gene sequence; and (c) a second homology region located downstream of 
the non-homologous replacement portion, the second homology region having a 
nucleotide sequence with substantial identity to a second Mowgli gene sequence, the 
second Mowgli gene sequence having a location downstream of the first Mowgli gene 
sequence in a naturally occurring endogenous Mowgli gene. 

Paragraph 25. A process for producing a Mowgli GPCR polypeptide, the method 
comprising culturing a host cell according to Paragraph 1 under conditions in which a 
nucleic acid encoding a Mowgli GPCR polypeptide is expressed. 

Paragraph 26. A method of detecting the presence of a Mowgli nucleic acid in a 
sample, the method comprising contacting the sample with at least one nucleic acid probe 
which is specific for said nucleic acid and monitoring said sample for the presence of the 
nucleic acid. 

Paragraph 27. A method of detecting the presence of a Mowgli polypeptide in a 
sample, the method comprising contacting the sample with an antibody according to 
Paragraph 12 and monitoring said sample for the presence of the polypeptide. 

Paragraph 28. A method of diagnosis of a disease or syndrome caused by or 
associated with increased, decreased or otherwise abnormal expression of Mowgli GPCR, 
the method comprising the steps of: (a) detecting the level or pattern of expression of 
Mowgli GPCR in an animal suffering or suspected to be suffering from such a disease; 
and (b) comparing the level or pattem of expression with that of a normal animal. 

Paragraph 29. A use, method, compound, antibody, pharmaceutical composition, 
vaccine, diagnostic kit, transgenic animal, or process according to any preceding 
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Paragraph, in which the Mowgli polypeptide, receptor or GPCR comprises: (a) a 
polypeptide comprising the amino acid sequence shown in SEQ ID NO: 3 or SEQ ID NO: 
5, or a fragment, homologue, variant or derivative thereof; (b) a polypeptide which 
comprises one or more regions which are homologous between SEQ ID NO: 3 and SEQ 
ID NO: 5, or which comprises one or more regions which are heterologous between SEQ 
ID NO: 3 and SEQ ID NO: 5; or (c) a polypeptide which is encoded by any of SEQ ID 
Nos. 1 , 2 or 4, or which is encoded by a fragment, homologue, variant or derivative 
thereof. 

Paragraph 30. A use, method, compound, antibody, pharmaceutical composition, 
vaccine, diagnostic kit, transgenic animal, or process according to any preceding 
Paragraph, in which the Mowgli gene or nucleic acid comprises: (a) a nucleic acid 
comprising the nucleic acid sequence shown in SEQ ID NO: 1, SEQ ID NO: 2 or SEQ ID 
NO: 4, or a fragment, homologue, variant or derivative thereof; or (b) a nucleic acid 
encoding a polypeptide comprising the amino acid sequence shown in SEQ ID NO: 3 or 
SEQ ID NO: 5, or a fragment, homologue, variant or derivative thereof. 

Example 1. Transgenic MOWGLI Knock-Out Mouse 

Construction of Mowgli Gene Targeting Vector 

Murine Mowgli genomic clones are isolated from a mouse large insert PAC library 
obtained from HGMP (Hinxton, UK) using a probe sequence amplified from the predicted 
murine open reading frame cDNA sequence (SEQ ID NO: 4), using standard techniques. 
The primers musMowgli 792F and 1014R are used to generate the probe. 

The isolated murine Mowgli genomic clones are then restriction mapped in the 
region of the Mowgli gene using small oligonucleotide probes and standard techniques. 
The murine genomic locus is partially sequenced to enable the design of homologous arms 
to clone into the targeting vector. The structure of the targeting vector, including the 
relevant restriction sites, is shown in Figure 5. 
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The murine Mowgli gene has two coding exons. The targeting strategy is designed 
to remove the majority of the second 7tm-containing exon. A 3.7 kb 5' homologous arm 
and a 2.5kb 3' homologous arm flanking the 7tm-containing exon are amplified by PGR 
and the fragments are cloned into the targeting vector. The 5' end of each oligonucleotide 
primer used to amplify the arms is synthesized to contain a different recognition site for a 
rare-cutting restriction enzyme, compatible with the cloning sites of the vector polylinkers 
and absent from the arms themselves. In the case of Mowgli, the primers are designed as 
listed in the sequence table below, with 5' arm cloning enzymes of Notl/Spel and 3' arm 
f cloning enzymes of Ascl/Fsel. 



In addition to the arm primer pairs (5'armF.2/5'armR.2 and 3'armF/3'armR.3), 
further primers specific to the Mowgli locus are designed for the following purposes: 5' 
and 3' probe primer pairs (5'prF/5'prR and 3'prF.3/3'prR.3) to amplify two short 200- 
300bp fragments of non-repetitive genomic DNA external to and extending beyond each 
arm, to allow Southern analysis of the targeted locus, in isolated putative targeted clones; a 
mouse genotyping primer pair (hetF and hetR) which allows differentiation between wild- 
type, heterozygote and homozygous mice, when used in a multiplex PGR with a vector 
specific primer, in this case, Asc403; and lastly, a target screening primer (3'scr.3) which 
anneals upstream of the end of the 3' arm region, and which produces a target event 
specific 2.6kb amplimer when paired with a primer specific to the 3' end of the vector 
(neo36). This amplimer can only be derived from template DNA from cells where the 
desired genomic alteration has occurred and allows the identification of correctly targeted 
cells from the background of clones containing randomly integrated copies of the vector. 
The location of these primers and the genomic structure of the MOWGLI locus used in the 
targeting strategy is shown in SEQ ID NO: 10. 

musMowgli 5'prF ACTCCATCTGGTAGGTAGGGCAGTGAC 

musMowQii 5'prR ATACCATGTTGCCTGGTCCAGATAGAC 

musMowgli 5'armF.2 Not AAAGCGGCCGCGTATGTGGCCATCAAGAAACGTGAAC 

musMowgli 5'armR.2 Spe TTTACTAGTTCTTCCCATGTCCCTCACTTGTGCTG 

musMowgli 3'armF Asc aaaggcgcgccAACACCTGCACCTGCCTCCTGGAACTC 

musMowgli 3'armR.3 Fse aaaggccggcCACAGCACACTGATCACAGCATCATC 

musMowgli 3'scr.3 GATAGCACAAGTGCCACCCTGTGATAG 

musMowgli 3'prF.3 ATTAGGGATGGATACCTGCCAAATGTG 

musMowgli 3'prR.3 GTCTTCTCTAGGCACTGTGTTTGAAGC 
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musMowgli hetF 
musMowgti hetR 
musMowgli 792F 
musMowgli 1014R 
Asc403 



TTGGCTGAAGCCCATCAAAAACACAAG 

GTGAGTTCCAGGAGGCAGGTGCAGGTG 
CTGCTTCCTCCCGTATCACTTGTG 



Neo36 



GCTTTCGCTCCTGGTTCTTATGTTTG 

CAGCCGAACTGTTCGCCAGGCTCAAGG 
CGCATCGCCTTCTATCGCCTTCTTGAC 



Table 1. Mowgli Primer Sequences 

The position of the homology arms is chosen to functionally disrupt the Mowgli 
gene by deleting the seven transmembrane spaiming regions. A targeting vector is 
prepared where the deleted Mowgli sequence is replaced with non-homologous sequences 
composed of an endogenous gene expression reporter (a frame independent lacZ gene) 
upstream of a selection cassette composed of a promoted neomycin phosphotransferase 
(neo) gene arranged in the same orientation as the Mowgli gene. 

Once the 5' and 3' homology arms had been cloned into the targeting vector 
pTKSIBLMNL (see Figure 5), a large highly pure DNA preparation is made using 
standard molecular biology techniques. 20 \xg of the freshly prepared endotoxin free DNA 
is restricted with another rare-cutting restriction enzyme Pmel, present at a unique site in 
the vector backbone between the amplicillin resistance gene and the bacterial origin of 
replication. The linearized DNA is then precipitated and resuspended in 100 |il of 
Phosphate Buffered Saline, ready for electroporation. 

24 hours following electroporation the transfected cells are cultured for 9 days in 
medium containing 200 \ig/m\ neomycin. Clones are picked into 96 well plates, replicated 
and expanded before being screened by PCR (using primers 3'scr.3 and neo36, as 
described above) to identify clones in which homologous recombination had occurred 
between the endogenous MowgH gene and the targeting construct. Positive clones can be 
identified at a rate of 1 to 5%. These clones are expanded to allow replicas to be frozen 
and sufficient high quality DNA to be prepared for Southern blot confirmation of the 
targeting event using the external 5' and 3' probes prepared as described above, all using 
standard procedures (Russ et al, Nature 2000 Mar 2;404(6773):95-92000). 
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The structure of the genomic locus of mouse Mowgli before knock-out is depicted 
in Figure 3. The structure of the genomic locus of mouse Mowgli after knock-out is 
depicted in Figure 4. 

Generation of Mowgli GPCR Deficient Mice 

C57BL/6 female and male mice are mated and blastocysts are isolated at 3.5 days 
of gestation. 10-12 cells from a chosen clone are injected per blastocyst and 7-8 
blastocysts are implanted in the uterus of a pseudopregnant Fl female. A litter of chimeric 
pups are bom containing several high level (up to 100%) agouti males (the agouti coat 
colour indicates the contribution of cells descendent from the targeted clone). These male 
chimeras are mated with female and MFl and 129 mice, and germline transmission is 
determined by the agouti coat colour and by PGR genotyping respectively. 

PGR Genotyping is carried out on lysed tail clips, using the primers hetF and hetR 
with a third, vector specific primer (Asc403). This multiplex PGR allows amplification 
from the wild-type locus (if present) from primers hetF and hetR giving a 210 bp band. 
The site for hetF is deleted in the knockout mice, so this amplification will fail from a 
targeted allele. However, the Asc403 primer will amplify a 43 1 bp band from the targeted 
locus, in combination with the hetR primer which anneals to a region just inside the 3' 
arm. Therefore, this multiplex PGR reveals the genotype of the litters as follows: wild-type 
samples will exhibit a single 210 bp band; heterozygous DNA samples yield two bands at 
2 1 0 bp and 43 1 bp; and the homozygous samples will show only the target specific 43 1 bp 
band. 

LacZ Staining 

The Xgal staining of dissected tissues is performed in the following manner. 

Representative tissue slices are made of large organs. Whole small organs and 
tubes are sliced open, so fixative and stain will penetrate. Tissues are rinsed thoroughly in 
PBS (phosphate buffered saline) to remove blood or gut contents. Tissues are placed in 
fixative (PBS containing 2% formaldehyde, 0.2% glutaraldehyde, 0.02% NP40, ImM 
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MgC12, Sodium deoxycholate 0.23mM) for 30-45 minutes. Following three 5 minute 
washes in PBS, tissues are placed in Xgal staining solution (4mM K Ferrocyanide, 
4mMKFerricyanide, 2mM MgC12, Img/mlX-gal in PBS) for 18 hours at 30C. Tissues are 
PBS washed 3 times, postfixed for 24 hours in 4% formaldehyde, PBS washed again 
before storage in 70% ethanol. 

To identify Xgal stained tissues, dehydrated tissues are wax embedded, and 7um 
section sections cut, counterstained with 0.01% Safranin (9-10 min). 

Behavioural and Neurological Testing 

Mice are housed under a 12h lights- 12h dark light schedule (lights-on at 6 am) 
with free access to food and water. Mice (n=12), of mixed sexes, aged 3 to 4 months old, 
are submitted to behavioural testing during the morning, between lOh and 13h to avoid 
any differential circadian effect on the test results. 

Example 2. Expression of Recombinant MOWGLI Protein 

Recombinant MOWGLI is expressed and purified. Two systems are used for 
expression. 

pTOPO-Echo Donor Based Construct 

A polynucleotide having the sequence shown in SEQ ID NO: 2 (Below) is 
obtained from the human MOWGLI nucleic acid sequence (SEQ ID NO: 1). The SEQ ID 
NO: 2 polynucleotide is cloned into a pTOPO-Echo Donor vector module (Invitrogen 
pUniV5/His Cat# ETOOl-lO). Transfection of the resulting construct into a host strain and 
induction of expression (according to the manufacturer's instructions) yields a fusion 
protein having the sequence of SEQ ID NO: 6. 

The fusion polypeptide SEQ ID NO: 6 contains a C terminal V5 tag (residues 365 
to 378) and His tag (residues 379 to 384) to aid detection and .purification. 
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pCDNAS-JE Based Construct 

A polynucleotide having the sequence shown in SEQ ID NO: 7 is amplified by 
PGR using the oligonucleotide primers 

TAATATGCTAGCAGAATGGGGTTCAACTTGACGC and 
5 CGCATATATTATGATTACACTGATGTGTAGAAGCTTTTTATA to incorporate new 
restriction sites, Nhel and Hindlll at the 5-prime and 3-prime ends respectively of Mowgli. 
This is then digested and ligated into similarly digested pcDNAS-JE (Invitrogen Cat# - 
K60 1 0-0 1 vector modified to remove BGH Poly- A). 

The resulting construct is used for high level expression in CHO-Kl cells , and 
10 other mammalian cell lines, under the control of the cmv promoter to yield a native 
polypeptide. 

A polynucleotide having the sequence shown in SEQ ID NO: 7 is amplified by 
PGR using the oligonucleotide primers 

TAATATGCTAGCAGAATGGGGTTCAACTTGACGC and 
15 TATAAAAAGCTTCTACTTATCGTCGTCATCCTTGTAATCCACATCAGTGTAATC 
ATAATATATGCG to incorporate new restriction sites, Nhel and Hindlll at the 5-prime 
and 3-prime ends respectively of Mowgli and to include a 3' fusion FLAG tag. This is 
then digested and ligated into similarly digested pcDNA5-JE (Invitrogen Cat# - K60 10-01 
vector modified to remove BGH Poly-A). 

20 The resulting construct is used for high level expression in CHO-Kl cells, and 

other mammalian cell lines, under the control of the cmv promoter to yield a fusion 
polypeptide with C terminal FLAG tag (double underline) to aid detection and 
purification. The resultant expressed fusion polypeptide has a sequence shown in SEQ ID 
NO: 8. 

25 Introduction of Construct into Cells 

The expression vector is introduced to the cells by lipofection (using Fugene-6 
from Roche, Cat# 1 814 433) among other similar methods. 
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Both transient and stable transfection of these cells is achieved. In transient 
expression the cells are transfected by lipofection using a large amount of vector which 
results in a short-lived fast expression of the protein. In a stable transfection, the vector, 
which includes a selectable marker for neomycin resistance becomes stably integrated into 
5 the genome of the host cell resulting in a long lived cell line with a high expression level 
of Mowgli. 

Cells expressing recombinant MOWGLI are used for assay development, antibody 
production, and other purposes as described. 

Expression in Other Host Cells 

10 The recombinant/fusion clone containing SEQ ID NO: 2 is recombined into a 

pBAD-Thio-E fector (Invitrogen Cat# ET 100-01) for high level bacterial expression under 
control of the araBAD promoter, using a Cre/Lox mediated recombination system. 

The recombinant/fusion clone containing SEQ ID NO: 2 is recombined into a 
pBlueBac 4.5E (Invitrogen Cat# ET3 10-01), using a Cre/Lox mediated recombination 
15 system, for subsequent recombination into Baculovirus expression systems. 

Recombination into MaxBac (Invitrogen Cat# K875-02) for high level expression in SF9 
and other insect cell lines. 

The recombinant/fusion clone containing SEQ ID NO: 2 is recombined into 
pcDNA 3.1-E (Invitrogen Cat# ET400-01) , using a Cre/Lox mediated recombination 
20 system, for high level expression in CHO-Kl (Chinese Hamster Ovary) cells, and other 
mammalian cell lines, under the control of the cmv promoter. 

Example 3. Expression of MOWGLI in vivo 

LacZ staining of Mowgli heterozygous and mutant mice can demonstrate the 
expression of this in the testis, ovary, skin, eye, mammary tissue, lung and prostate gland. 
25 Such expression indicates a potential role for Mowgli in the development of therapeutics 
to treat abnormalities of these organs including cancers. 
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Furthermore, by comparing Mowgli with the role and localisation of purine 
receptors (e.g. P2Y12 which regulates mucin expression on the conjunctiva) Mowgli 
receptors may also be employed as potential targets for developing therapeutics relating to 
dry-eye disorders and cystic fibrosis. 

Example 4. Motor Control and Balance in MOWGLI Knock-Out Mouse 

MOWGLI mutant mice can be used to test the role of the protein in motor co- 
ordination and balance by their performance on the Rotarod apparatus. 

Motor co-ordination and balance are measured by performance on the rotarod. We 
use an accelerating Rotarod (Ugo Basile, Linton instruments, Jones and Roberts 1968. The 
quantiative measurement of motor inco-ordination in naive mice using an acelerating 
rotarod. J Pharm Pharmacol, 1968 Apr;20(4):302-4)). We also use the Rotarod to assess 
motor learning by repeating the task over several days. 

Mice are placed on the Rotarod, which accelerates at a constant rate from 4 to 40 
rpm in 5 in. As the Rotarod reaches higher speed, the mice often grip the Rotarod and 
hang on for a full rotation (i.e. passive rotation). The time at which the mouse makes one 
full rotation is recorded. Mice are left on the Rotarod after the first passive rotation and 
allowed to perform the rest of the test until they drop from the rod. Mice are given 3 trials 
on three consecutive days. 

Example 5. Visual Ability, Anxiety and Mobility in MOWGLI Knock-Out Mouse 

Tests for visual ability, anxiety and mobility in a MOWGLI knock-out mouse are 
conducted using a visual cliff test. 

The Visual cliff is developed by Fox (Fox M.W Anim Behav. 1965 Apr- 
Jul;13(2):232-3) and provides a measure of gross visual ability. It evaluates the ability of 
the mouse to see the drop-off at the edge of a horizontal surface. Time for the animal to 
move one placed on the cliff (latency) is a measure of anxiety. 
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A Perspex box is built with a horizontal plane connected to a vertical drop of 0.5m. 
A black and white chequerboard pattern accentuates the vertical drop-out. A sheet of clear 
Perspex is placed across the top horizontal cliff, extending over the top, thus there is the 
visual appearance of a cliff, but in fact the Perspex provides a solid horizontal surface. 

Each mouse is given 10 trials on the visual cliff. A trial consists of placing the 
mouse on the centre 'ridge' and noting the time taken from the animal to move off the 
'ridge' (latency) and recording the side on which the mouse stepped. The result is 
considered positive when the animal chooses to walk on the chequered "safe" side and 
avoids the cliff and negative result for the other way round. After 5 trials the box is turned 
through 180 degrees and a further 5 trials given. This is done to eliminate the variable of 
the position of the observer. The platform is cleaned between each animal. 

Example 6. Tests for Sensitivity to External Stimuli and Pain (Analgesia Testing) in 
MOWGLI Knock-Out Mouse: Paw Pressure Test 

MOWGLI mutants can be tested for their sensitivity to touch and pain stimuli 
using a range of tests. The tests in this and the following four Examples assess both 
neuropathic and inflammatory pain. 

Skin sensitivity is assessed by applying pressure on the hindpaw with a sharpened 
dowel rod whilst the animal is resting on a grid. Responses are graded as follow: 0 = no 
withdrawal; 1 =slow withdrawal of the paw; 2 = medium withdrawal of the paw; 3 = fast 
withdrawal of the paw. 

Example 7. Tests for Sensitivity to External Stimuli and Pain (Analgesia Testing) in 
MOWGLI Knock-Out Mouse: Tail Flick Test 

A tail flick analgesia test is performed using a Tail-Flick Analgesia Meter. This 
equipment provides an easy to use method to determine pain sensitivity accurately and 
reproducibly in rodents (D* Amour and Smith, 1941 Journal of Pharmacology and 
Experimental Therapeutics. Vol 41 p419-424).The instrument has a shutter-controlled 
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lamp as a heat source. The lamp is located below the animal to provide a less confining 
environment. Tail flick is detected by the automatic detection circuitry, which leaves the 
user's hands free to handle the animal. The animal is restrained in a ventilated tube and its 
tail placed on a sensing groove on top of the equipment. 

Activation of an intense light beam to the tail through opening of the shutter results 
in discomfort at some point when the animal will flick its tail out of the beam. In the 
automatic mode a photo-detector detects the tail motion causing the clock to stop and the 
shutter to close. The total time elapsed between the shutter opening and the animal's 
reaction is recorded. 

Responses of mutant transgenic mice are compared with age and sex matched 
wild- type mice. A single animal may be subjected to different heat settings to produce an 
increase in tail temperature no greater than 55^C. 

Using the tail flick test MOWGLI mutants can be tested for their sensitivity to heat 
induced pain and compared to their wild-type counterparts. Decreased sensitivity in 
mutants is associated by the withdrawal of their tails after a longer time period of exposure 
to the heat source. 

Example 8. Tests for Sensitivity to External Stimuli and Pain (Analgesia Testing) in 
MOWGLI Knock-Out Mouse: Formalin Test 

The formalin test measures the response to a noxious substances injected into a 
hind paw. A volume of 20)al of a 1% formalin solution is injected through a fine gauge 
needle subcutaneously into the dorsal surface of one hindpaw. Licking and biting the 
hindpaw is quantitated as cumulative number of seconds engaged in the behaviours. A 
rating scale is used: 1= the formalin injected paw rests lightly on the floor bearing less 
weight; 2= the injected paw is elevated; 3= the injected paw is licked, bitten or shaken. 

Two phases of responses are seen in the formalin test. Phase 1 begins immediately 
after injection and lasts about 10 mins, representing the acute burst of activity from pain 
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fibres. Phase two begins about 20 mins after injection and continues for about one hour. 
This phase appears to represent responses to tissue damage, including inflammatory 
hyperalgesia. 

Using the formalin test MOWGLI mutants may be shown to be less sensitive to 
inflammatory, pain and may show a reduced severity of response in the formalin test when 
compared to wild type animals. 

Example 9. Tests for Neuropathic Pain Testing in MOWGLI Knock-Out Mouse 

The most straigthforward test for neuropathic pain is the chronic constriction of the 
sciatic nerve. In this model, peripheral nerve injury is induced by cuffing the sciatic nerve 
(Cheng HY et al. DREAM is a critical transcriptional repressor for pain modulation. Cell, 
2002, 108, 31-43). Paw withdrawal thresholds are measured using Von-Frey hairs once 
every 2 days for 10 days following cuff implantation. Spontaneous pain behaviour is also 
apparent, separately from Von-Frey testing, and is caracterised by rapid shaking and 
lifting of the paw. 

In the neuropathic pain test, mutants MOWGLI mice can be shown to have an 
increase in paw withdrawal tlireshold and reduced spontaneous pain behaviour as 
compared to wildtype mice. 

Example 10. Tests for the Role of MOWGLI in Energy Metabolism 

MOWGLI mutant mice may be used to test the fimction of the MOWGLI protein 
in energy metabolism by the determination of total body oxygen consumption (V02) (a 
parameter of energy expenditure) through indirect calorimetry. Oxygen is required for 
combustion of fiaels. By measuring total body oxygen consumption the total amount of 
fuel used can be calculated. Energy expenditure is measured by estimating oxidation rates • 
of macronutrients from rates of respiratory exchange of oxygen and carbon dioxide 
(Vidal-Puig AJ, Grujic C, Zhang CY, Hagen T, Boss O, Ido Y, Szczepanik A, Wade J, 
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Mootha V, Cortright R: Energy Metabolism in Uncoupling Protein 3 Gene Knockout Mice 
. J Biol Chem 275:16258-16266, 2000). 

These measurements provide information not only about the total energy 
expenditure but also about preferential use of specific nutrients as a source of energy. 
Energy expenditure is measured in basal conditions and in response to some established 
modulators (e.g. fasting would produce a decrease in energy expenditure), specific diet 
manipulation (e.g. High fat diet should increase energy expenditure), or pharmacological 
treatment (e.g. adrenergic receptor agonists should induce energy expenditure (8)). This 
information determines whether energy expenditure is altered in MOWGLI inutants and 
provides insights on the mechanisms involved (e.g. oxidation of fatty acids 
carbohydrates etc). These measurements are performed in calorimeters with individual 
chambers. 

Thus oxygen consumption and respiratory exchange ratio are measured (AA) in 
basal, fasted (24/36h), and other experimental conditions (e.g. diets, pharmacological 
treatments) using a calorimeter (e.g. Columbus Instruments, Columbus, OH). Standard 
conditions of the measurements are settling time 100 sec, measuring time 50 sec, with 
room air as reference. However, these parameters are optimised specifically for every 
experiment. Air in the respiratory chamber is continuously exchanged. This chamber has 
two sensors that measure the difference between the oxygen and CO2 that enters and 
leaves the chamber. Experiments may require up to 1-hour calibration time, period during 
which animals can be placed in the chamber. Thus a single animal is deposited in an 
individual chamber and left undisturbed during the time required for the experiment. The 
length of the experiment would be variable (2-9 h) depending on its specific nature (e.g. 
fed or fasted animals will be in the chamber during 2h while Lipopolisacaride injected 
animals will require 6-8 hours). 

Differences in reading between MOWGLI mutant and wild type mice indicate a 
role for the MOWGLI protein in metabolism and indicate the use of this target for the 
development of therapeutics tackling disorders of hypercholesterolaemia, dislipdaemias 
and obesity. 
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Each of the appHcations and patents mentioned in this document, and each 
document cited or referenced in each of the above appHcations and patents, including 
during the prosecution of each of the appHcations and patents ("application cited 
documents") and any manufacturer's instructions or catalogues for any products cited or 
mentioned in each of the applications and patents and in any of the application cited 
documents, are hereby incorporated herein by reference. Furthermore, all documents cited 
in this text, and all documents cited or referenced in documents cited in this text, and any 
manufacturer's instructions or catalogues for any products cited or mentioned in this text, 
are hereby incorporated herein by reference. 

Various modifications and variations of the described methods and system of the 
invention wiU be apparent to those skilled in the art without departing from the scope and 
spirit of the invention. Although the invention has been described in connection with 
specific preferred embodiments, it should be understood that the invention as claimed 
should not be unduly limited to such specific embodiments. Indeed, various modifications 
of the described modes for carrying out the invention which are obvious to those skilled in 
molecular biology or related fields are intended to be within the scope of the claims. 
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Claims 

1. A Movvgli GPCR polypeptide comprising the amino acid sequence shown in SEQ 
ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 6 or SEQ ID NO: 8, or a homologue, variant or 
derivative thereof. 

2. A nucleic acid encoding a polypeptide according to Claim 1 . 

3. A nucleic acid according to Claim 2, comprising the nucleic acid sequence shown 
in SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 7 or SEQ ID NO: 10, or a 
homologue, variant or derivative thereof. 

4. A polypeptide comprising a fragment of a polypeptide according to Claim 1 . 

5. A polypeptide according to Claim 3 which comprises one or more regions which 
are homologous between any two or more of SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 
6 and SEQ ID NO: 8, or which comprises one or more regions which are heterologous 
between any two or more of SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 6 and SEQ ID 
NO: 8. 

6. A nucleic acid encoding a polypeptide according to Claim 4 or 5. 

7. A vector comprising a nucleic acid according to Claim 2, 3, or 6. 

8. A host cell comprising a nucleic acid according to Claim 2, 3, or 6, or vector 
according to Claim 7. 

9. A transgenic non-humaii animal comprising a nucleic acid according to Claim 2, 3 
or 6, or a vector according to Claim 7. 
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10. A transgenic non-human animal according to Claim 9 which is a mouse. 

1 1 . Use of a polypeptide according to Claim 1, 4 or 5 in a method of identifying a 
compound which is capable of interacting specifically with a G protein coupled receptor. 

12. Use of a transgenic non-human animal according to Claim 9 or 10 in a method of 

identifying a compound which is capable of interacting specifically with a G protein 
coupled receptor. 

13. A method for identifying an antagonist of a Mowgli GPCR, the method comprising 
contacting a cell which expresses Mowgli receptor with a candidate compound and 
determining whether the level of cyclic AMP (cAMP) in the cell is lowered as a result of 
said contacting. 

14. A method for identifying a compound capable of lowering the endogenous level of 
cyclic AMP in a cell which method comprises contacting a cell which expresses a Mowgli 
GPCR with a candidate compound and determining whether the level of cyclic AMP 
(cAMP) in the cell is lowered as a result of said contacting. 

15. . A method of identifying a compound capable of binding to a Mowgli GPCR 
polypeptide, the method comprising contacting a Mowgli GPCR polypeptide with a 
candidate compound and determining whether the candidate compound binds to the 
Mowgli GPCR polypeptide. 

16. A compound identified by a method according to any of Claims 11 to 15. 

17. A compound capable of binding specifically to a polypeptide according to Claim 1, 
4 or 5. 

1 8. Use of a polypeptide according to Claim 1 , 4 or 5, or part thereof or a nucleic acid 
according to Claim 2, 3 or 6, in a method for producing antibodies. 
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19. An antibody capable of binding specifically to a polypeptide according to Claim 1, 
4 or 5, or part thereof or a polypeptide encoded by a nucleotide according to Claim 2, 3 or 
6, or part thereof. 

20. A pharmaceutical composition comprising any one or more of the following: a 
polypeptide according to Claim 1, 4 or 5, or part thereof; a nucleic acid according to Claim 
2, 3 or 6, or part thereof; a vector according to Claim 7; a cell according to Claim 8; a 
compound according to Claim 16 or 17; and an antibody according to Claim 19, together 

. with a pharmaceutically acceptable carrier or diluent. 

21. A vaccine composition comprising any one or more of the following: a polypeptide 
according to Claim 1, 4 or 5, or part thereof; a nucleic acid according to Claim 2, 3 or 6, or 
part thereof; a vector according to Claim 7; a cell according to Claim 8; a compound 
according to Claim 16 or 17; and an antibody according to Claim 19. 

22. A diagnostic kit for a disease or susceptibility to a disease comprising any one or 
more of the following: a polypeptide according to Claim 1, 4 or 5, or part thereof; a 
nucleic acid according to Claim 2, 3 or 6, or part thereof; a vector according to Claim 7; a 
cell according to Claim 8; a compound according to Claim 16 or 17; and an antibody 
according to Claim 19. 

23. A method of treating a patient suffering from a disease associated with enhanced 
activity of a Mowgli GPCR, which method comprises administering to the patient an 
antagonist of Mowgli GPCR. 

24. A method of treating a patient suffering from a disease associated with reduced 
activity of a Mowgli GPCR, which method comprises administering to the patient an 
agonist of Mowgli GPCR. 
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25. A method according to Claim 23 or 24, in which the MowgH GPCR comprises a 
polypeptide having the sequence shown in SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 6 
or SEQ ID NO: 8. 

26. A method for treating and/or preventing a disease in a patient, which comprises the 
step of administering any one or more of the following to the patient: a polypeptide 
according to Claim 1, 4 or 5, or part thereof; a nucleic acid according to Claim 2, 3 or 6, or 
part thereof; a vector according to Claim 7; a cell according to Claim 8; a compound 
according to Claim 16 or 17; an antibody according to Claim 19; a pharmaceutical 
composition according to Claim 20; and a vaccine according to Claim 20. 

27. An agent comprising a polypeptide according to Claim 1, 4 or 5, or part thereof; a 
nucleic acid according to Claim 2, 3 or 6, or part thereof; a vector according to Claim 7; a 
cell according to Claim 8; a compound according to Claim 16 or 17; and/or an antibody 
according to Claim 19, said agent for use in a method of treatment or prophylaxis of 
disease. 

28. Use of a polypeptide according to Claim 1, 4 or 5, or part thereof; a nucleic acid 
according to Claim 2, 3 or 6, or part thereof; a vector according to Claim 7; a cell 
according to Claim 8; a compound according to Claim 16 or 17; and an antibody 
according to Claim 19, for the preparation of a pharmaceutical composition for the 
treatment or prophylaxis of a disease. 

29. A non-human transgenic animal, characterised in that the transgenic animal 
comprises an altered Mowgli gene. 

30. A non-human transgenic animal according to Claim 29, in which the alteration is 
selected from the group consisting of: a deletion of Mowgli, a mutation in Mowgli 
-resulting in loss of function, introduction of an exogenous gene having a nucleotide 
sequence with targeted or random mutations into Mowgli, introduction of an exogenous 
gene from another species into Mowgli, and a combination of any of these. 
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31. A non-human transgenic animal having a functionally disrupted endogenous 
Mowgli gene, in which the transgenic animal comprises in its genome and expresses a 
transgene encoding a heterologous Mowgli protein. 

32. A nucleic acid construct for functionally disrupting a Mowgli gene in a host cell, 
the nucleic acid construct comprising: (a) a non-homologous replacement portion; (b) a 
first homology region located upstream of the non-homologous replacement portion, the 
first homology region having a nucleotide sequence with substantial identity to a first 
Mowgli gene sequence; and (c) a second homology region located downstream of the non- 
homologous replacement portion, the second homology region having a nucleotide 
sequence with substantial identity to a second Mowgli gene sequence, the second Mowgli 
gene sequence having a location downstream of the first Mowgli gene sequence in a 
naturally occurring endogenous Mowgli gene. 

33. A process for producing a Mowgli GPCR polypeptide, the method comprising 
culturing a host cell according to Claim 8 under conditions in which a nucleic acid 
encoding a Mowgli GPCR polypeptide is expressed. 

34. A method of detecting the presence of a nucleic acid according to Claim 2, 3 or 6 
in a sample, the method comprising contacting the sample with at least one nucleic acid 
probe which is specific for said nucleic acid and monitoring said sample for the presence 
of the nucleic acid, 

35. A method of detecting the presence of a polypeptide according to Claim 1, 4 or 5 
in a sample, the method comprising contacting the sample with an antibody according to 
Claim 19 and monitoring said sample for the presence of the polypeptide. 

36. A method of diagnosis of a disease or syndrome caused by or associated with 
increased, decreased or otherwise abnormal expression of Mowgli GPCR, the method 
comprising the steps of: (a) detecting the level or pattern of expression of Mowgli GPCR 
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in an animal suffering or suspected to be suffering from such a disease; and (b) comparing 
tlie level or pattern of expression with that of a normal animal. 

37. A diagnostic kit according to Claim 22, a method according to Claim 23, 24, 26 or 
36, an agent according to Claim 27 or a use according to Claim 28, in which the disease is 
selected from the group consisting of: platelet ADP receptor defect; autosomal recessive 
deafness; myotonic dystrophy; essential hypertension; Usher syndrome, type 3; juvenile- 
onset cataract; congenital cataract; and 3-methylcrotonylglycinuria I. 

38 A nucleic acid sequence selected from group consisting of musMowgli 5'prF 
(ACTCCATCTGGTAGGTAGGGCAGTGAC), musMowgli 5'prR 
(ATACCATGTTGCCTGGTCCAGATAGAC), musMowgli 5'armF.2 Not 
(AAAGCGGCCGCGTATGTGGCCATCAAGAAACGTGAAC), musMowgli 5'armR.2 
Spe (TTTACTAGTTCTTCCCATGTCCCTCACTTGTGCTG), musMowgli 3'armF Asc 
(aaaggcgcgccAACACCTGCACCTGCCTCCTGGAACTC), musMowgli 3'armR.3 Fse 
(aaaggccggcCACAGCACACTGATCACAGCATCATC), musMowgli 3'scr.3 
(GATAGCACAAGTGCCACCCTGTGATAG), musMowgli 3'prF.3 
(ATTAGGGATGGATACCTGCCAAATGTG), musMowgli 3'prR.3 
(GTCTTCTCTAGGCACTGTGTTTGAAGC), musMowgli hetF 
(TTGGCTGAAGCCCATCAAAAACACAAG), musMowgli hetR 
(GTGAGTTCCAGGAGGCAGGTGCAGGTG), musMowgli 792F 
(CTGCTTCCTCCCGTATCACTTGTG), musMowgli 1014R 
(GCTTTCGCTCCTGGTTCTTATGTTTG), Asc403 
(CAGCCGAACTGTTCGCCAGGCTCAAGG), and Neo36 
(CGCATCGCCTTCTATCGCCTTCTTGAC). 

39. A nucleic acid sequence, including a Mowgli nucleic acid sequence, a polypeptide 
sequence, including a Mowgli polypeptide sequence, a method of treatment, a method of 
diagnosis, a host cell, vector, transgenic animal, assay, diagnostic kit, vaccine, 
pharmaceutical composition or agent substantially as hereinbefore described with 
reference to and as shown in the accompanying drawings. 
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ABSTRACT 
RECEPTOR 

We disclose Mowgli G-protein coupled receptor (GPCR) polypeptides comprising 
the amino acid sequence shown in SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 6 or SEQ 
ID NO: 8, and homologues, variants and derivatives thereof. Nucleic acids capable of 
encoding Mowgli polypeptide are also disclosed, in particular, those comprising the 
nucleic acid sequences shown in SEQ ID NO.: 1, SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID 
NO: 7 or SEQ ID NO: 10. 

Figure 2 



SEQUENCE LISTING 



107 



<110> Paradigm Therapeutics Limited 
<120> Receptor 
<130> P11369GB 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



<160> 5 

<170> SeqWin99, version 1.02 

<210> 1 

<211> 1478 

<212> DNA 

<213> Homo sapiens 



<400> 1 

cctggcagga 

acctaccttg 

cctgccgacc 

gaagacaccg 

aataacctat 

get t acagtg 

taacgagctg 

gaacaccacc 

atttgtggca 

taaaaccagc 

gacatttcca 

tctctgcaga 

tgggctgata 

gtacagcata 

gtctttgcca 

ctcaaaactt 

ctgcttgttt 

catccacaaa 

gagcatcagg 

aattcctttt 

atattactgc 

aatttacttt 

aaccaggagt 

ttatgattac 

taaataaatg 



gaatttgaaa 
tctggtaggg 
ttagtttcaa 
tatgaggtga 
gctgaaccca 
catcacaact 
cacggccaag 
cttcacaatg 
agcatcttgc 
ttcatattct 
tttcgaatag 
tacacttcag 
agcattgatc 
accttcacga 
aacatcatcc 
aaaagtcctt 
gtggccgtgc 
tccagcaggc 
gttgttgtgg 
acttttagtc 
aaagaaatta 
ttcatgtgta 
gaaagcatca 
actgatgtgt 
tttcttttca 



gggtgcccca 
gagatgtttc 
agcttattct 
atggacagcc 
cgcctcaatc 
gaagaatggg 
agagtcacaa 
aatttgacac 
tgaatggttt 
atctcaaaaa 
tccatgatgc 
ttttgtttta 
gctatctgaa 
aggttttatc 
tgacaaatgg 
tgggggtcaa 
tggtgattct 
aattcataag 
ctgtgttttt 
acttagacag 
cacttttctt 
ggtcattttc 
gatcactgca 
aggcctttta 
ttatccttaa 



aaggacaatc 
gttttcatgc 
taattagaga 
agccaccaca 
gtccccaagt 
gtt caacttg 
ttcaggcaac 
aattgtcttg 
agcagtgtgg 
catagtggtt 
aggatttgga 
tgcaaacatg 
ggtggtcaag 
tgtttgtgtt 
tcagccaaca 
atggcatacg 
gatcggatgt 
tcagtcaagc 
tacctgcttt 
gcttttagat 
gtctgcgtgt 
aagaaggctg 
aagtgtgaga 
ttgtttgttg 
aaaaaaaa 



tctaaagggg 
tttaccagaa 
caagaaacct 
atgaaagaaa 
gtttcctgac 
acgcttgcaa 
aggagcgacg 
ccggtgcttt 
atcttcttcc 
gcagacctca 
ccttggtact 
tatacttcca 
ccatttgggg 
tgggtgatca 
gaggacaata 
gcagjtcacct 
tacatagcca 
cgaaagcgaa 
ctaccatatc 
gaatctgcac 
aatgtttgcc 
ttcaaaaaat 
agatcggaag 
gaatcgatat 



t aagggagat 
aatccacttc 
gtttcaactt 
tcaaaccagg 
acgcatcttt 
aattaccaaa 
ggccaggaaa 
atctcattat 
acattaggaa 
taatgacgct 
tcaagtttat 
tcgtgttcct 
actctcggat 
tggctgtttt 
tccatgactg 
atgtgaacag 
tatccaggta 
aacataacca 
acttgtgcag 
aaaaaatcct 
tggatccaat 
caaatatcag 
t tcgcatat a 
gtacaaagtg 
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<211> 1077 

<212> DNA 

<213> Homo sapiens 
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atggggt tea 

cacaattcag 

gacacaattg 

ggtttagcag 

aaaaacatag 

gatgcaggat 

ttttatgcaa 

ctgaaggtgg 

ttatctgttt 



acttgaegct 
gcaacaggag 
tcttgccggt 
tgtggatctt 
tggttgeaga 
ttggacettg 
acatgtatac 
tcaagccatt 
gtgtttgggt 



tgcaaaatta 
cgacgggcca 
gctttatctc 
cttccacatt 
cctcataatg 
gtacttcaag 
ttccatcgtg 
tggggactct 
gatcatggct 



ccaaataacg 
ggaaagaaca 
attatatttg 
aggaataaaa 
acgctgacat 
tttattctct 
ttcettgggc 
cggatgtaca 
gttttgtctt 



agctgcaegg 
ccacccttea 
tggcaagcat 
ccagcttcat 
ttccatttcg 
geagatacac 
tgataagcat 
gcataacctt 
tgccaaacat 



ccaagagagt 
caatgaattt 
cttgctgaat 
attctatctc 
aatagtccat 
ttcagttttg 
tgatcgctat 
cacgaaggt t 
catcctgaca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1478 
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120 

180 
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540 
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aatggtcagc caacagagga caatatccat gactgctcaa aacttaaaag tcctttgggg 

600 

gtcaaatggc atacggcagt cacctatgtg aacagctgct tgtttgtggc cgtgctggtg 660 

attctgatcg gatgttacat agccatatcc aggtacatcc acaaatccag caggcaattc 720 

5 ataagtcagt caagccgaaa gcgaaaacat aaccagagca tcagggttgt tgtggctgtg 780 

ttttttacct gctttctacc atatcacttg tgcagaattc cttttacttt tagtcactta 840 

gacaggcttt tagatgaatc tgcacaaaaa atcctatatt actgcaaaga aattacactt 900 

ttcttgtctg cgtgtaatgt ttgcctggat ccaataattt actttttcat gtgtaggtca 960 

ttttcaagaa ggctgttcaa aaaatcaaat atcagaacca ggagtgaaag catcagatca 1020 

10 ctgcaaagtg tgagaagatc ggaagttcgc atatattatg attacactga tgtgtag 1077 

<210> 3 

<211> 358 

<212> PRT 

15 <213> Homo sapiens 

<400> 3 

Met Gly Phe Asn Leu Thr Leu Ala Lys Leu Pro Asn Asn Glu Leu His 
15 10 15 



20 



35 



50 



Gly Gin Glu Ser His Asn Ser Gly Asn Arg Ser Asp Gly Pro Gly Lys 
20 25 30 



Asn Thr Thr Leu His Asn Glu Phe Asp Thr lie Val Leu Pro Val Leu 

25 35 40 45 

Tyr Leu lie He Phe Val Ala Ser lie Leu Leu Asn Gly Leu Ala Val 
50 • 55 60 

30 Trp lie Phe Phe His lie Arg Asn Lys Thr Ser Phe He Phe Tyr Leu 

65 70 75 80 



Lys Asn He Val Val Ala Asp Leu lie Met Thr Leu Thr Phe Pro Phe 

85 90 95 

Arg lie Val His Asp Ala Gly Phe Gly Pro Trp Tyr Phe Lys Phe He 

100 105 110 



Leu Cys Arg Tyr Thr Ser Val Leu Phe Tyr Ala Asn Met Tyr Thr Ser 
40 115 120 125 

He Val Phe Leu Gly Leu He Ser He Asp Arg Tyr Leu Lys Val Val . 
130 135 140 

45 Lys Pro Phe Gly Asp Ser Arg Met Tyr Ser He Thr Phe Thr Lys Val 

145 150 155 160 



Leu Ser Val Cys Val Trp Val He Met Ala Val Leu Ser Leu Pro Asn 
165 170 175 

He He Leu Thr Asn Gly Gin Pro Thr Glu Asp Asn He His Asp Cys 
180 185 190 



Ser Lys Leu Lys Ser Pro Leu Gly Val Lys Trp His Thr Ala Val Thr 
55 195 200 205 

Tyr Val Asn Ser Cys Leu Phe Val Ala Val Leu Val He Leu He Gly 
210 215 220 

60 Cys Tyr He Ala He Ser Arg Tyr He His Lys Ser Ser Arg Gin Phe 



225 



230 



109 

235 



240 



He Ser Gin Ser Ser Arg Lys Arg Lys His Asn Gin Ser He Arg Val 
245 250 255 

5 

Val Val Ala Val Phe Phe Thr Cys Phe Leu Pro Tyr His Leu Cys Arg 
260 265 270 

He Pro Phe Thr Phe Ser His Leu Asp Arg Leu Leu Asp Glu Ser Ala 
10 275 280 285 

Gin Lys He Leu Tyr Tyr Cys Lys Glu He Thr Leu Phe Leu Ser Ala 
290 295 300 

15 Cys Asn Val Cys Leu Asp Pro He He Tyr Phe Phe Met Cys Arg Ser 
305 310 315 320 



20 



Phe Ser Arg Arg Leu Phe Lys Lys Ser Asn He Arg Thr Arg Ser Glu 
325 330 335 

Ser He Arg Ser Leu Gin Ser Val Arg Arg Ser Glu Val Arg He Tyr 
340 345 350 



Tyr Asp Tyr Thr Asp Val 
25 355 

<210> 4 

<211> 1160 

<212> DNA 

30 <213> Mus musculus 

<400> 4 

atggggctca acttgacgct tacaaaatta ccaggtaatg agctgtacag ccaagcaagt 60 

cacactgcaa acagcacaag tgagggacat gggaagaact ctaccctgca caacaaattt 120 

35 gacaccatca tcctgccagt gctttaccta gttatctttg tggcaagcat cctgctgaac 180 

ggtctggccg tgtggatctt cttccacatt cggaataaaa ccagcttcat attttatctc 240 

aaaaatatag tggtagctga cctcatcatg accctgacat tcccattccg aatagtccgt 300 

gatgcaggat tcggaccttg gtacttcgag tttatcctct gcagatacac ctcagttttg 360 

ttctatgcaa acatgtatac atctatcgtg tttcttgggc tgatcagtgt tgatcggtat 420 

40 ctaaaggtgg taaagccctt tggtgactct cgcatgtaca gcataacctt caccaaagtt 480 

ttatcagttt gtgtttgggt gatcatggct attctgtcct tgccaaacat catactgact 540 

aatgggcagc caactaagga aaatattcat gactgcatga aactcaaaag tccgctggga 600 

gccaagtggc atatggctgt cacctatgtg gacagctgtt tgtttgtggc cgtgctggtg 660 

atcctgattg gatgctacat agccatctcc agatacatcc acaaatccag caggcaattc 720 

45 ataagccaat cgagccggaa gcgaaagcac aaccagagca tccgcgtggt cgtggctgtg 780 

ttttttacct gcttcctccc gtatcacttg tgcagaatcc cctttacctt cagtaactta 840 

gacaggcttc tggatgaatc agcacataaa atcctctact attgcaaaga aatgacactt 900 

ttcttgtctg cgtgcaacgt gtgcctggat ccgataattt attttttcat gtgtaagtca 960 

ttttcaagaa ggttattcaa gaaatcaaac ataagaacca ggagcgaaag catcaggtcg 1020 

50 ctgcaaagcg tccgaagatc agaagtgcgc atctattatg actacactga tgtgtaagga 1080 

ctagaggcca gggacccttt tctattggaa tcagtatgta aaagagtgta aataaatgtt 1140 

tctttttatt atcttggctg 1160 

<210> 5 

55 <211> 358 

<212> PRT 

<213> Mus musculus 

<400> 5 

60 Met Gly Leu Asn Leu Thr Leu Thr Lys Leu Pro Gly Asn Glu Leu Tyr 



1 

Ser Gin Ala Ser 
20 

Asn Ser Thr Leu 
35 

Tyr Leu Val lie 
50 

Trp lie Phe Phe 
65 

Lys Asn lie Val 



Arg lie Val Arg 
100 

Leu Cys Arg Tyr 
115 

He Val Phe Leu 
130 

Lys Pro Phe Gly 
145 

Leu Ser Val Cys 



He He Leu Thr 
180 

Met Lys Leu Lys 
195 

Tyr Val Asp Ser 
210 

Cys Tyr He Ala 

225 

He Ser Gin Ser 



Val Val Ala Val 
260 

He Pro Phe Thr 
275 

His Lys He Leu 
290 

Cys Asn Val Cys 
305 

Phe Ser Arg Arg 



5 

His Thr Ala Asn 



His Asn Lys Phe 
40 

Phe Val Ala Ser 
55 

His He Arg Asn 
70 

Val Ala Asp Leu 
85 

Asp Ala Gly Phe 



Thr Ser Val Leu 
120 

Gly Leu He Ser 
135 

Asp Ser Arg Met 

150 

Val Trp Val He 
165 

Asn Gly Gin Pro 



Ser Pro Leu Gly 
200 

Cys Leu Phe Val 
215 

He Ser Arg Tyr 

230 

Ser Arg Lys Arg 
245 

Phe Phe Thr Cys 



Phe Ser Asn Leu 
280 

Tyr Tyr Cys Lys 
295 

Leu Asp Pro He 
310 

Leu Phe Lys Lys 



110 

10 

Ser Thr Ser Glu 
25 

Asp Thr He He 



He Leu Leu Asn 
60 

Lys Thr Ser Phe 
75 

He Met Thr Leu 
90 

Gly Pro Trp Tyr 
105 

Phe Tyr Ala Asn 



Val Asp Arg Tyr 
140 

Tyr Ser He Thr 
155 

Met Ala He Leu 
170 

Thr Lys Glu Asn 
185 

Ala Lys Trp His 



Ala Val Leu Val 
220 

He His Lys Ser 
235 

Lys His Asn Gin 
250 

Phe Leu Pro Tyr 
265 

Asp Arg Leu Leu 



Glu Met Thr Leu 
300 

He Tyr Phe Phe 
315 

Ser Asn He Arg 



15 

Gly His Gly Lys 
30 

Leu Pro Val Leu 
45 

Gly Leu Ala Val 



He Phe Tyr Leu 
80 

Thr Phe Pro Phe 
95 

Phe Glu Phe He 
110 

Met Tyr Thr Ser 
125 

Leu Lys Val Val 



Phe Thr Lys Val 
160 

Ser Leu Pro Asn 
175 

He His Asp Cys 
190 

Met Ala Val Thr 
205 

He Leu He Gly 



Ser Arg Gin Phe 
240 

Ser He Arg Val 
255 

His Leu Cys Arg 
270 

Asp Glu Ser Ala 
285 

Phe Leu Ser Ala 



Met Cys Lys Ser 
320 

Thr Arg Ser Glu 



Ill 

325 330 335 

Ser He Arg Ser Leu Gin Ser Val Arg Arg Ser Glu Val Arg He Tyr 

340 345 350 

Tyr Asp Tyr Thr Asp Val 
355 



SEQ ID NO: 6 Fusion protein of human Mowgli ( seq ID No 3) with V5 and His tags 
10 atc-terminal end. 

MGFNLTLAKLPNNELHGQESHNSGNRSDGPGKNTTLHNEFDTIVLPVLYLIIFVASI 
LLNGLAVWIFFHIRNKTSFIFYLKNIVVADLIMTLTFPFRIVHDAGFGPWYFKFILC 
RYTSVLFYANMYTSIVFLGLISIDRYLKVVKPFGDSRMYSITFTKVLSVCVWVIMA 
VLSLPNIILTNGQPTEDNIHDCSKLKSPLGVKWHTAVTYVNSCLFVAVLVILIGCYI 
15 AISRYIHKSSRQFISQSSRKRKHNQSIRVVVAVFFTCFLPYHLCRIPFTFSHLDRLLD 
ESAQKILYYCKEITLFLSACNVCLDPIIYFFMCRSFSRRLFKKSNIRTRSESIRSLQSV 
RRSEVRIYYDYTDVRAIRELGKPIPNPLLGLDSSHHHHHH 



SEQ ID NO: 7 

20 TAATATGCTAGCAGAATGGGGTTCAACTTGACGCTTGCAAAATTACCAAATAA 
CGAGCTGCACGGCCAAGAGAGTCACAATTCAGGCAACAGGAGCGACGGGCCA 
GGAAAGAACACCACCCTTCACAATGAATTTGACACAATTGTCTTGCCGGTGCT 
TTATCTCATTATATTTGTGGCAAGCATCTTGCTGAATGGTTTAGCAGTGTGGAT 
CTTCTTCCACATTAGGAATAAAACCAGCTTCATATTCTATCTCAAAAACATAGT 

25 GGTTGCAGACCTCATAATGACGCTGACATTTCCATTTCGAATAGTCCATGATG 
CAGGATTTGGACCTTGGTACTTCAAGTTTATTCTCTGCAGATACACTTCAGTTT 
TGTTTTATGCAAACATGTATACTTCCATCGTGTTCCTTGGGCTGATAAGCATTG 
ATCGCTATCTGAAGGTGGTCAAGCCATTTGGGGACTCTCGGATGTACAGCATA 
ACCTTCACGAAGGTTTTATCTGTTTGTGTTTGGGTGATCATGGCTGTTTTGTCTT 

30 TGCCAAACATCATCCTGACAAATGGTCAGCCAACAGAGGACAATATCCATGA 
CTGCTCAAAACTTAAAAGTCCTTTGGGGGTCAAATGGCATACGGCAGTCACCT 
ATGTGAACAGCTGCTTGTTTGTGGCCGTGCTGGTGATTCTGATCGGATGTTACA 
TAGCCATATCCAGGTACATCCACAAATCCAGCAGGCAATTCATAAGTCAGTCA 

35 TACCTGCTTTCTACCATATCACTTGTGCAGAATTCCTTTTACTTTTAGTCACTTA 
GACAGGCTTTTAGATGAATCTGCACAAAAAATCCTATATTACTGCAAAGAAAT 
TACACTTTTCTTGTCTGCGTGTAATGTTTGCCTGGATCCAATAATTTACTTTTTC 
ATGTGTAGGTCATTTTCAAGAAGGCTGTTCAAAAAATCAAATATCAGAACCAG 
GAGTGAAAGCATCAGATCACTGCAAAGTGTGAGAAGATCGGAAGTTCGCATA 

40 TATTATGATTACACTGATGTGTAGAAGCTTTTTATA 



SEQ ID NO: 8 

MGFNLTLAKLPNNELHGQESHNSGNRSDGPGKNTTLHNEFDTIVLPVLYLIIFVASI 
LLNGLAVWIFFHIRNKTSFIFYLKNIVVADLIMTLTFPFRIVHDAGFGPWYFKFILC 



10 



20 
25 
30 



40 



50 
55 



65 
70 
75 
80 
85 
90 
95 



112 

RYTSVLFYANMYTSIVFLGLISIDRYLKVVKPFGDSRMYSITFTKVLSVCVWVIMA 

VLSLPNIILTNGQPTEDNIHDCSKLKSPLGVKWHTAVTYVNSCLFVAVLVILIGCYI 
AISRYIHKSSRQFISQSSRKRKHNQSIRVVVAVFFTCFLPYHLCRIPFTFSHLDRLLD 
ESAQKILYYCKEITLFLSACNVCLDPIIYFFMCRSFSRRLFKKSNIRTRSESIRSLQSV 
RRSEVRIYYDYTDVDYKDDDDK 



SEQ ID ND:10: Mouse Genomic Locus from 5'prF to 3'prR.3 



Sequence Range: 46601 to 54800 

>5'prF 



I 46700 
, ^ AAAAGGATCCTACAAGGGCCATTGTACCATGGATTTGACACCCTATAATTTATACAACTTTATTAI^^ 
1 J TTTTCCTAGGATGTTCCCGGTAACATGGTACCTAAACTGTGGGATATTAAATATGTTC 

46800 

GTCGAGGCTTTTAATATTCTGTGTCACAGGGACACGCCCCAATGAACAATAGAGAGATTCACCCGAGGCAAAGGTGCTAAGATTCTGCATCTAAGCCTT 
CAGCTCCGAAAATTATAAGACACAGTGTCCCTGTGCGGGGTTACTTGTTATCTCTCTAAGTGGGCTCCGTTTCCACGATTCTAAGACGT 

46900 

ATCACTCCCACTCCAACTGCATACAAAAGAGACTCTAAGAGTAAATGGACACAITACACTTAGGTCACTGTGACGAAAAGTCTATC^ 
TAGTGAGGGTGAGGTTGACGTATGTTTTCTCTGAGATTCTCATTTACCTGTGTAATGTGAATCCAGTGACACTC 

<S 'prR 

>5 ' arml 

! I 

>S 'armF. 2 



I I 



47000 

TGGTATCTGGCCATCAAGAAACGTGAACATAAACAGTGCAACaGAGAATCTGTAATTGTGCTAAATT^ 
ACCATACACCGGTAGTTCTTTGCACrTGTATTTGTCACGTTGTCrrCTTAGACATT^ 

35 47100 
CATGGAGTGTGCTATCTGGTATGTGTATGTGTGAGTGTGTGTGTGTACaGGTGTGTGTGTGTGTGTACAC^ 

GTACCTCACACGATAGACCATACACATACACACTCACACACACACATGTCCACACACACACACACATGTGTGAAGGTTCTGTCTGTTAGCACC^ 

472 00 

AATCATCACrCTGTACTAAACCGTGTCATATGCAGGGATTGCCATCCTAATGAACACAGAAGACAATCriTTCTACTTTTAAAC 
TTAGTAGTGAGACATGATTTGGCACAGTATACGTCCCTAACGGTAGGATTACTTGTGTCTTCTGTTAGAAAGATGAAAATTTGGTTAAAGGTCA 

47300 

A ^ CAATGAATTACCAAGCCATGGATCATTCAAGGAGGTCCCTAAATTGGCCACAATCTTTACTAGACAAATAACATACTTAAACAGAGCG^ 
H J GTTACTTAATGGTTCGGTACCTAGTAAGTTCCTCCAGGGATTTAACCGGTGTTAGAAATGATCTGT^ 

47400 

TTCCArrTAATGACTGATGTTCCTAAGTAGGTGAAGATGCAAGCCTGCACATTTGGGGG^ 
AAGGTAAATTACTGACTACAAGGATTCATCCACTTCTACGTTCGGACGTGTAAACCCCCTTTAAAC^ 

47500 

GCAGGGAAGAAAGAACAGGAGGCnXMCTTACTTCAACAACACACAAGCCACCTTAM 

47600 

CCTTAGGAGCCATGGTTGTCTGCCACTGCATCrcCAAAATTGCCCAGAAGAGGTTTTAACTACCTACTTGTCTGA^^ 

GGAATCCTCGGTACCAACAGACGGTGACGTAGACGTTTTAACGGGTCrrCTCCAAAATTGATGGATGAACAGACTCTAGGTAATGTTTAG^ 

/cn 

OU GAATGAAGTAATAACCAGGGTTAGGCTGATCCCATGGACCTGGAGAGGCCAATACTCCAAAACACTAATCTTGATCATGAGCTAAGGCAAATATAA GAAA 
CTTACrTCATTATTGGTCCCAATCCGACTAGGGTACCTGGACCTCTCCGGTTATGAGGTTTTG^ 

478 00 

AOVCAACAGCAAACAGACATAGACTCTCTCAGAGCCGCAAACAGGTCGAGAGCCACTOGGAAGTGAAGTTACT^ 
TGTGTTGTCG^mTCTGTATCTGACAGAGTCTCGGCCnTTGTCa^GCrCTCGGTG^^ 

47900 

TCTG C ' XTl ' Ln ' lU ' lG GTT G ' lUn tSATATGTCTATTTAGTCTTGTTAGGGAG' rrrriXJ 

AGACGAAAGAAAACCAACAAACTATACAGATAAATCAGAACAATCCCTCAAAAACAACAAAAAAOUUVCAAAAACAAAAATTT^^ 

48000 

ATAGAATTCTrCTGATATTATGATTGAAGACTGCCTTTAATATTAACTACAATAACTAGGTATATTATCATTTAAT;^ 

TATCTTAAGAAGACTATAATACTAACTTCTGACGGAAATTATAATTGATGTTATTGATCCATATAATAGTAAATTATTCTGACAAAGAGATACATTAAT^ 
> 

I 48100 

CGGAACCTACTACTACTACTATTACTACTATAATTrACCTTCTGTGTTTAGATGGTCTTTGGGACCCT 

48200 

ACACAGACCCAGGTTGCTAATGCrAACAGCTGTAATACCCCATCCCCAGGAGGATAAAGCTATTGAATTCTAATCACAT^ 
TGTGTCTGGGTCCAACGATTACGATTGTCGACATTATGGGGTAGGGGTCCTCCTATTTCGATAACT^ 

48300 

ACTCCTATCTGATTGCTACAACACACAGCCATCCTTTCAAGAGATGACATCACTCTTCCTTTTCJ^ 
TGAGGATAGACTAACGATGTTGTGTGTCGGTAGGAAAGTTCTCTACTGTAGTGACAAGGAAAAGTTCCAAACGGGGACT 

<188R 

I 48400 
TAAAACTTACTTGGCTGGCTGTATTCCCCTAAGAGCACTATCCACAGCAAGATCAAAGTGCCAGTTAATAGCrrAATT^^ 
ATTTTGAATGAACCGACCGACATAAGGGGATTCTCGTGATAGGTGTCGTTCTAGTTTCACGGTCAATTATCGAAT^ 

48S 00 

ATCTAAAGCTCACTAAAGCTGTACrrrCTTACCCTTTGAAGGGCTTCAAATGCCAATCTGGATCT^ 



113 



TAGATTTCGAGTGATTTCGACATGAAGAATGGGAAACTTCCa3AAGTTTACG(m"AGACCTAGAAGACT^^ 

48600 

TAAGTCAATACAGGCCTCCATGGTATGTIXauVGGCAACATGTCAAGCGGGAAGCTGGAGAGAGCACC^ 
ATTCAGTTATGTCCGGAGGTACCATACAAGTTCCGTTGTACAGTTCGCCCTTCXSACCTCTCTC^ 

4B700 

AATACCATCAGTCTCGCAACAGGCTAGCAGAGGTATAAATTACTACCCTGTATAAGGATATTTAGGCCAGAAAAACCAGTTT^ 

TTATGGTAGTCAGAGCGTTGTCCGATCGTCTCCATATTTAATGATGGGACATATTCCTATAAATCCGGT C i I ' i I ' l GGTCAAAAATTTTTGTGTCACTCCT 

48800' 

ATGGAAGGGGGGAAATGATGAGAGAAGGAGCG^ 

48900 

CAGCACTGCCrrrCCCATTTGACTACTTCAAGGAAAGAAAACAATATATAATAAACACCCACGATGGC^^ 
GTCGTGACGGAAAGGGTAAACTGATGAAGTTC C ' lTrC ' lU ' ri GTTATATATTATTTGTGGGTGCTACCU'l'I'l U ^ 

49000 

AAATATCGGTTCGAAGCTGAGATAAAGTGAGGTTCTTATGAGCTAGTGGGAATTAAAGAGAAGATCAGAAATTATC^ 
TTTATAGCCAAGCTTCGACTCTATTTCACTCCAAGAATACTCGATCACCCTTAA' l ' l ' lXrirri CT 

49100 

TTGTXAGCTGTGATAGTTTATGATGTCTACAAAAACAGATGACAAAATGTGCTCGTCAGGCAGCTGAGAGAGTCACTGAGGAAATGGCTTCAGGACA^ 
AACATTCGACACrATCAAATACTACAGATGTTTTTGTCTACTGTTTTACACGAGCAGTCCGTCGACTCTCTCAGTGACTCCTTTACCGAAGTCCTC 

49200 

tgtagagggagggagcttccagaacatcaagttcctgagcagggagaaagggcaggatgagagcaagagctgggaggcacctatt;^ 



49300 




49400 

ACATCAGAAAGAGTGGCGATGGTCCAAGGCACAGACTAAAGCTGTGCATCCCTCyiCAaUiCCACCATGTCrc 
TGTAGTCTTTCTCACCGCTACC^GGTTCCGTGTCTGATTTCGACACGTAGGGAGTGTGTTGGTGGTACAGAGGGTC 

49S00 

tgctgcaaaaaccaaatgcccacatgccatctcattaaataagagtgtcgtgatgatatctataagtttgtacaccaaaaataaaagagaaagcaa^ 
acgacgtttttggtttacgggtgtacggtagagtaatttattctcacagcactactatagatattcaaacatgtggttt^ 

49600 

actttgcaaattcatactaagaacggcgggtataatctaatcttgtaaataacaacaaaaatggtagcctgcacatgtttt^ 

TGAAACGTTTAAGTATGA lUX;i ' iG CCGCCCATATTAGATTAGAACATTTATT G"l ' it; ' rri ' il 'ACCATCGGACGTGTA 

49700 

GATAATAAACTTGAACACTAACAGAAAGGTAAAAATCAATTTTATACCAACTTACAGAGACTTCCATGTCTTT^^ 
CTATTATTTGAACTTGTGATTGTCTTTCCATrrTTAGTTAAAATATGGT^ 

49800 

ctgtatctgacagcataatctgtgtagatgattataatccccaaacagatctcctacagtatggcatacctactcacagat^ 
gacatagactgtcgtattagacacatctactaatattaggggtttgtctagaggatgtcataccgtatggatgagtgtctaatacacaatcct^ 

49900 

AARAGAATCTGCAAGGTAAATCTAGCrCTGCTTGGGACGTTCGTCCCTTTAGAAGAGGGCTGATCTTTTTT^ 
TTTTCTTAGACGTTCCATTTAGATCGAGACGAACCCTGCAAGCAGGGAAATCrrTCTCCCGACTAGAAAAAAAAGGGGGACCCCCT 

5000O 

atttctaacaacggtccctacgtaatcctcaggaaacaggcagagaatttgctgctttactggtttgctttgccgt^^ 
taaagattgttgccagggatgcattaggagtccrttgtccgtcrcttaaacgacgaaat^ 

50100 

TGCAGGTATTGGTAGAGGGACGGACTGGGTCAAAGGATACACTGATACAATCCCCACGCTCTGCTTTTAT^ 
ACGTCCATAACCATCTCCCTGCCrGACCCA Gl ' in CCTATGTGACTATGTTAGGGGTGCGAGACGAAAATAACAGAACC^^ 

50200 

CCTCAAGTATTGGTGAGTCTATGTGTACCAATATCCAATGTTGAACTAAACACCAGGGTTTATACAGATATACTATAT^ 
GGAGTTCATAACCACTCAGATACACATGGTTATAGGTTACAACTTGATTTGTGGTCCCAAATATCTCTATATC 

50300 

GAGACTCAAAGGATCmSAAATGTGAGGAAAGGGAATCCCCATTTATTTGGCCACTGGGGGAAAAA^ 
CTCTGAGTTTCCTAGAACTrrACACrCCTTTCCCnnrAGGGGTAAATAAACCG 



j 50400 

cagtggctggagagatggcctagaggttatgagtgcttcagactcttcctgggttctgagttgacaagtgaactccaagtcaggaggtcacaactgcctc 
gtcaccgacctctctaccggatctccaatactcacgaagtctgagaaggacccaagactcaactgttcacttgaggttcagtcctccagtc 

5050 0 

agttctagggcaccagagcccttcttctgatctctgcaagcactgcatacctgcagtgtccattttcagagcctgagatataca^^ 



50600 

agacaacctttacacccacccagaaagactcagcgctaagggagattxxttgtccctaatgtgaatgtrrc^^ 
tctgttggaaatgtgggtgggtctttctgagtcgcgattccctctaccgacagggattaca 

N E L Y S Q> 



3 > 



<5 ' armR . 2 

I 50700 

agcaagtcacactgcaaacagcacaagtgagggacatgggaagaactctaccctgcacaacaaatttgacaccatcatcctgccagtgctttacctagtt 
tcgttcagtgtgacgtttgtcgtgttcactccctgtacccttcttgagatgggacgtgttgttta^ 
ashtanstseghgknstlhnkpdtiilpvlylv> 



>7tm 

I 50800 

atctttgtggcaagcatcctgctgaacgqtctggcogtgtggatcttcttccacact 
tagaaacaccgttcgtaggacgacttgccagaccggcacacctagaagaaggtgtaagccrtattttgotcga^ 
ifvasillnglavwipfhirnktsfipylkniv> 



50900 

tagctgacctcatcatgaccctgacattcccattccgaatagtccgtgatgcaggattcggaccttggtacttcgagtttatcctctgcagatacacctc 

atcgactggagtagtactgggactgtaagggtaaggcttatcaggovctacgtcctaagcctggaaccatgaagctcaaataggagacgtct^ 

vadlimtltfpfrivrdagfgpwyfefilcryts> 



114 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
^ 70 
75 
80 
85 
90 
95 
100 
105 
110 



51000 

TCAAAACAAGATACGTTTGTACATATGTAGATAGCACAAAGAACCCGACTAGTCACAACTAGCCATAGATTTCCACCATTTCGGGJ^ 

VLFYANMYTS I VPLGLISVDRYLKVVKPFGDSR> 



51100 

ATGTACAGCATAACCTTCACCAAAGTTTTATCAGTTTGTGTTTGGGTGATCATGGCTAT^ 

TACATGTCGTA1TGGAAGTGGTTTCAAAATAGTCAAACACAAACCCACTAGTACCGATAAGACAGGAACGGTTTGTAGTAT<^ 
MYS ITPTKVLSVCVWVIMAILSLPNI ILTNGQP> 



S1200 

CTAAGGAAAATATTCATGACTGCATGAAACTCAAAAGTCCGCTGGGAGCCAAGTGGCATATGGCTGTCACCTATGTGGACAGCTGT^ 
GATTCCnTTATAAGTACTGACGTACTTTGAGTTTTCAGGCGACCCTCGGTTCACCG^ 

TKENIHDCMKLKSPLGAKWHMAVTYVDSCLFVAV> 



51300 

GCTGGTGATCCTGATTGGATGCTACATAGCCATCTCCAGATACATCCACAAATCCAGCAGGCAATTCATAAGCCAATCGAGCCGGM 
CGACCACrAGGACTAACCTACGATGTATCGGTAGAGGTCTATGTAGGTGTTTAGOTCGTCCGTTAAGTATTCGGTTAGCT^ 

LVILIGCYIAISRYIHKSSRQFISOSSRKRKHH> 



>792P 

I 51400 
ACTTGTGCAGAATCCCCTTTACCTTCAGTAACrrAGACAGGCTTCTGG 
GTCTCGTAGGCGCACCAGCACCGACACAAAAAATGGACGAAGGAGGGCATAGTGAACACGTCTTAGGGGAAAIXXSAAGTCATTGAATC^ 
QSIRVVVAVFFTCFLPYHLCRIPFTPSNLDRLL> 



51500 

ATGAATCAGCACATAAAATCCTCTACTATTGCAAAGAAATGACA Ci"ri '' i 'CTTGTCTGCGTGCAACGTGTGCCTGGATCCGATAATTTA' 

TACTTAGTCGTGTATTTTAGGAGATGATAACGTTTCTTTACTGTGAAAAGAACAGACGCACGTTGCACACGGACCTAGGCTATTAAATA^ 

DESAHKILYYCKEMTLFLSACNVCLDPIIYFFMC> 



<1014R 

I 51600 
TAAGTCATTTTCAAGAAGGTTATTCAAGAAATCAAACATAAGAACCAGGAGCGAAAGCATCAGGTCGCTGCAAAGCGTCCGAAGA 
ATTCAGTAAAAG i ' 1 ' ClT CCAATAAG' lU'C ' l 'iU AGiTrU TATTCrrGGTCCTCGCTTTCGTAGTC 

KSFSRRLFKKSNIRTRSESIRSLQSVRRSEVRIs. 



_ 



51700 

TATTATGACTACACTGATGTGTAAGGACTAGAGGCCAGGGAGCCrrn'CrATTGGAATCAGTATGTAAAAGAGTGTAAA 
ATAATACTGATGTGACTACACATTCCTGATCTCCGGTCCCTCGGAAAAGATAACCTTAGTCATAC AinUU 'CTCACATTTATTrACAAAGA^ 
YYDYTDV*> 



I 51800 
TTGGCTGAAGCCCATCAAAAACACAAGTTTTTTGGGTTTGTTTGTTTGTTTTT^ 

AACCGACTTCGGGTAGTTTTTGTGTTCAAAAAACCCAAACAAACAAACAAAAAAATTTCCTATCTTTACTO 

3 > 

>3 ' arm mJc3 

I " 
>3 ' armF 

I 51900 

AAAAAAACATTAGGAAACAAAAAATTGTCGGATCGATTGGAGCCTGCGGAGGTGGAGGGGCAGACTGGCGAGTGGCAGGTGAACACC^ 
1 i n iU r GTAATCCTTTG' il ' rriU 'AACAGCCTAGCTAACCTCGGACGCCTCCACCTCCCCGTCTGACTO 

<hetR 

I 52000 
TGGAACTCACTCACTGCTGCTATGAAGTCACTTCTGTAGAGAACTCAAAGTATAGAGAATTGTTAAGGTTGAATATTTGAAAAAAA^ 
ACCTTGAGTGAGTGACGACGATACTTCAGTGAAGACATCTCTTGAGTTTCATATCTCTTAACAATTCCAACTTATAAA^ 

52100. 

GAATTCTAGCTACCTAGCCCCGTTTGGCCATTTCCCCTCAGTGCACITATaVATGTATaVGGC 
CTTAAGATCGATGGATCGGGGCAAACCGGTAAAGGGGAGTCACGTGAATAGTTACATAGTCCCTGTGTACATC 

52200 

AAGTCCATCCACXXrrTrCACAGTACATACTGATGCTGTCACCACCAACTAACTCATTCrAAATAAGC^ 



52300 

AAAAGTTTCTGCTATAACATCCAATTAATATTTGCCAGTATTTTCATCTTTGAGGTTTTAACAAAAATTACATAT^ 
TTTTCAAAGACGATATTGTAGGTTAATTATAAACGGTCATAAAAGTAGAAACTCCAAAATTGTTTTTAATGTATAATTGT^ 

52400 

TATGCCCAACAAGAATATCAGTAGTTTAGCrATGCAACTGTAAAATGAAAGATGTAGTTCACTAAGTTATATCAGAAAGTCCAACAGTAGTCATTTAACT 
ATACGGGTTGTTCTTATAGTCATCAAATCGATACXSTTGAC Arrin ' A CTTTCTACATCAAGtGATTC^ 

5 2500 

GTTCGGCrA' 11 ' lU ' CTCiU ' l GCCAATAATGCA C ' iU ' l ' IXS AATGACTTTGTTCCTCAAA 
CAAGCCGATAAAAGAGAAACGGTTATTACGTGAAAACTTACTGAAACAAGGAGTTT(^ 

> 

1 52600 
AAAAAAATATATTGCATTTATTTTTCGGGGGGGAGGGGATGGGATTTGTGCATGCTACCGTGCGCACGTGAATGTCAGAGGGCAA 
TTTTTTTATATAACGTAAATAAAAAGCCCCCCCTCCCCTACCCTAAACACGTACGATGGCACGCGTGCACTTACAGTCT 

52700 

AGAGAGGAGGGTTGTACACCCCAAATTCTGAATGGAGTCCAGGAGTCTGAACCTTTGTTCACGGAAATAA^ 
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52600 

TTTTTAACACGGTACTGTGATCTGCAGCAAACATTAACAGGAAGAGTTAAGATATGCTTGCTGATGAAAAGACAC^ 
AAAAATTGTGCCATGACACTAGACGTCGTTTGTAATTGTCCTTCrCAATTCTATACGAACGACTACTTTTCTGTGCCATA^ 

52900 

GGGTGAAAGAACCTGAAATCCAGATGAAGTTGGGCATGGTAGCATGCCCCTGTAATCTCCAGTCTCCTACAGCAAGACAGGAGA^ 
CCavCTTTCTTGGACTTrAGGTCTACrrcAACCCGTACCATCGTACXXXX^ 

53000 

TCCGGAAACTACAGCCCAGCa^GCTGGGCACATGCAGCAGTGAACGAGAAGGGCAAACyiCTCAAGGTTGCT 
AGGCCTTTGATGTCGGGTCGGTCGACCaSTGTACGTCGTCACTTGCTCTTCCCG'lT 

> 

I 53100 
GCATACACACACACATACACACACACACACACACACACACACACACACACACACACAATTATTAAAGCAGGCACTATAT ^ 

53200 

TATAACTGACTACTTCAGTTACGTTCTAGCAAATTGCACAATTCTGAGGCTO 
ATATTGACTGATGAAGTOU^TGCAAGATCGTTTAACGTGTTAAGACTCCGAACTCGTTTCG^ 

53300 

AGGCACAGAAGCAGTGTAAAGGTCCAGCCCACTGAGGAAACATGCCAATTTrAGCAGAGGACTTTGTATGTAAGGAGAT AAAA GA^ 
TCCGTGTCTTCGTCACATTTCCAGGTCGGGTGACTCCTTTGTACGGTTAAAATCGTCrCCTGAAACATACATTCCTCT 

53400 

CAAAATATTACTCTCTCAAGATCTCAGCTCCCTAAACAAAATGACTCTTCAGGCAAGTTAATTTTTATAGACCGTGTTTGGTC^ 
GTTTTATAATGAGAGAGTTCTAGAGTCGAGGGATTTGTTTTACTGAGAAGTCCGTTCAATTAAAAAT^ 

53500 

e^GCACCCAATGTGTGATACTAAGTAGGTCAACACTGTTTTAGGTCAGGGGATCAGAGCAGATGGAAGACTJ^ 
GTCGTGGGTTACACACTATGATTCATCCAGTTGTGACAAAATCCAGTCCCCTACTCTCGTCTACCTTCTGATT 

53600 

GCAGGATGAGACTCAGCTACCTT/JVCATGAAGGAAGAACAACAATGCTGCGGGAAAGTCCTTGGGTCCAGCCTCTCCTCTCT 
CGTCCTACTCTGAGTCGATGGAATTGTACTTCCTTCTTGTTGTTACGACGCCCTTTCyiGGAACCCAGGTCGGAGA^^ 

53700 

CAGCTCGGTGCCTAGTCAGAGCACTCAGGAACAACATTATAAGCrrCGGTACTGGGGGACATTGTTAAATGTCCTCTAGAACCrra 
GTCGAGCCACGGATCAGTCTCGTGAGTCCTTGTTGTAATATTCGAAGCCATGACCCCCTGTAACAATTTACAGGAGATCTTGGAGCCC^ 

53800 

TTCCTTCTTCCATCAGCCAGTTCTCTGCAGCACTGCAGCCCACCAGGGTTTCTAGACCAGGCTTCAGCTCAATTGTGATTCATC 
AAGGAAGAAGGTAGTCGGTCAAGAGACGTCGTGACGTCGGGTGGTCCOUUiGATCTGGTCCGAAGTCGAGTTAACACT 

53900 

TTTAAAATTATGTTAAATCTCATTTGCTCTCTCTAAGTTGGCTACCCCTGTGGTTATTGGGCACATTTAATC 

AAATTTTAATACAATTTAGAGTAAACGAGAGAGATrCAACCGATGGGGACACCAATAACCCGTGTAAATTACACTCCACCCAGAATC 

S4000 

TATTGTTAAATCAGTGGCACAACTCCAAACATCGACTTACAAAAGAGTGGTCGGACCATCCTCAGTTAAAGAAGTCAGATTATAAAATCTATCTAAATTG 
ATAACAATTTAGTCACCGTGTTGAGGTTTGTAGCTGAATGTTTTCTCACCAGCCTGGTAGGAGTCAATTTCTTCAGTCTAATAT^^ 

54100 

CAGCTCCTATTTTGCATCAATACCCCAAACCGAGTTAGCCAGAACCAACCCTGAACAACTTACAGATTTAOUVGAAGAGGTGGA 
GTCGAGGATAAAACGTAGTTATGGGGTrrGGCTCAATCGGTCTTGGTTGGGACTTCTTCAATGTCTAAATGT^^ 

> 
I 

54200 

TCCCACTCAAAGTGGAGAAATTATAAATTCTGTTTGATGGCTTTCTACAGTAACCTCAAATAGGAAACAGAATATTCCTTGGGGCTTT^ 
AGGGTGAGTTTCACCTCTTTAATATTTAAGACAAACTACCGAAAGATGTCATTGGAGTITATCCTTTGTCTTATAAGG^^ 

54300 

GTTGTGAAACCTTTATTTGAAATTGTTGTGAACAGAGAGATTTCCGATTTCAGGTGCTGGGTGTGGGAATATTTGT^ 

<3 ' armR-3 
I544OO 

GATTTTCTATATTATAATCACAAACTCATGTTGCTTCATGTGCGGCCTCCCATGCAACCCGGGAATACGATGATGCTGTGATCAGTGTGCTGTGCTCCT 
CTAAAAGATATAATATTAGTGTTTGAGTACAACGAAGTACACGCCGGAGGGTACGITGGGCCCTTATGCTACTACGACACTAGTCACAC^ 



54500 

CCCTACAGCCTATCACAGGGTGGCACTTGTGCTATCATGGCAGACTTTAGAGCCTTCAGATTTCAGATTCTAGATTAG^ 
GGGATGTCGGATAGTGTCCCACCGTGAACACGATAGTACCGTCTGAAATCTCGGAAGTCTAAAGTCTAAGATCTAATCCCTACCTATGGACGGTTTACAC 

54600 

CAGGTAAAATTTTACAATACAGGAATAAATACCATATTCATTCTTTCTTAAATAGAGAAGTTTAAAATGAAATTTTAGTAT^ 
GTCCATTTTAAAATGTTATGTCCTTATTTATGGTATAAGTAAGAAAGAATTTATCTCTTCAAA' r 

54700 

AAGAAACTGACCTCACATATAAACAGCrCAATTGATTAAACCACTTCATAACTTCTTAlTTCCCATTATAA^ 
TTCTrrGACTGGAGTGTATATTTGTCGAGTTAACTAATTTGGTGAAGTATTGAA(^ 

<3'prR.3 > 

I I 54800 

AAGCTTCAAACACAGTGCCTAGAGAAGACACATTAAATTAGATAAAATACTCAAGATAGAATGTGTGTGTGTGTGTG 
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*- > GiaLVil vilrtkklrtptniiilHLAvADLLilltlppwalyy Ivg 
H+L++++++ + +t++ +++ H vADL ++lt p+ ++ + + 
tfserSeq 59 LHGLATfflFFK-IFITKTSFIFYLKMIVVADLIMTLTFPFRITHDAeF 104 

gsedffpfGsalCJclvtaldvvrunyaSillLtalSiDRYlAIvhPlryrrr 
g W f +1C+ ++ l+++nmy+Si +L +ISiDRYl +v+P+ +r+ 
UserSeq 105 PffYFKFILCRYTSyLFYAMmTSIVFL&LISIDRYLKTVKPF&DSFtl 152 

rtsprrAkvvillvVvlalllslPplliswvktveegiigtlnvnvtvCli 
++ +kv++++vff"v+ ++lslP +++ + ++ e++ ++ C+ 

UserSeq 153 YS-ITFTK7LSyCVff7IMAVLSLPiaiLTNGQPTEDK IHDCSK 194 

dfpeestasvstwlrsyvllstlvgFllPllvilvcYtrllrtlr 

+ + + V++ + + F+ 1++++ cY I r +++++++ 

UserSeq 195 LKSPLG7 KWHTAVTYYNSCLFYAVLYILIGCYIAISRYIHkssrq 239 

kaaktllvvvvvFvlCWlPyf ivllldtlc . IsiimsstCe 

+++++++k + vvv vF+ C+lPy++ + +t+ +1+ ++ + 
UserSeq 240 f isqssrkrKHMQSIRTyTATFFTCFLPYHLCRIPFTFShLDR— LLDES 287 



UserSeq 



lervlptallvtlwLay vHsclNPilY < -* 
++1 + + +tl+L+ +N cl+PilY 
288 AQKILYYCKEITLFLSACH7CLDPIIY 314 
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FIGURE 3 
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